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CHAPTER  I:  PROBLEM,  THEORY  AND  RELATED  RESEARCH 

Problem  Statement 

One  of  the  obstacles  to  an  effective  theory  of  instruction  is 
the  inability  to  identify  the  relevant  variables  involved  in  instruc- 
tion — both  the  variables  centered  in  the  learning  situation  and  the 
variables  centered  in  the  learner.  Ideally,  educators  should  be  able 
to  specify  the  optimal  instructional  method  for  a given  student  in  a 
given  situation.  Of  course,  they  cannot  at  present  do  this  and  prob- 
ably they  will  never  be  able  to  do  it  without  some  qualification. 
However,  it  is  possible  to  move  in  this  direction  by  identifying  as 
many  variables  as  possible  which  affect  the  learning  situation,  and 
then  determining  both  their  effect  and  the  effect  of  their  inter- 
actions. 

The  recent  advances  in  auto- instruction  — particularly  in  com- 
puter-based systems  — affords  us  the  opportunity  to  experimentally 
manipulate  variables  and  to  determine  their  effects  on  learning  to 
a degree  which  is  impossible  in  conventional  learning  systems.  Un- 
fortunately, while  research  on  auto-instruction  has  repeatedly  shown 
that  such  techniques  do  facilitate  learning,  few  conclusions  can  be 
drawn  about  auto-instruction  in  general.  Frequently,  there  has  been 
little  or  no  consistency  of  findings  about  which  variables  are  the 
important  ones  or  what  the  effects  of  the  variables  studied  are 
(Silberman,  1962;  Schramm,  1964).  Consequently,  while  researchers 
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such  as  Stolurow  have  recommended  that  auto-instructional  and  con- 
ventional methods  be  alternated  in  the  classroom  (1963),  there  are  no 
guidelines  for  teachers  to  follow  in  implementing  auto-instruction. 

The  purpose  of  the  present  study  is  to  move  toward  identifying 
and  isolating  some  of  the  variables  which  influence  programmed  learn- 
ing, with  the  ultimate  hope  of  giving  the  teacher  guidelines  for  imple- 
mentation of  these  and  other  techniques  and  with  an  eye  on  the  future 
development  of  a theory  of  instruction.  The  variables  involved  in 
any  learning  situation  are  numerous  and  a single  study  cannot  hope  to 
manipulate  more  than  a few  of  them.  For  example,  Fry  (1963)  has 
.itemized  212  variables  in  programmed  learning. 

The  more  salient  independent  variables  involved  in  verbal  auto- 
instructional  situations  can  be  roughly  divided  into  five  categories 
with  a minimum  of  overlap.  These  categories  are  (1)  Structural  or 
format  variables,  (2)  Content  variables,  (3)  Learner-centered  variables 
(4)  Structure-learner  interaction  variables,  and  (5)  Content-learner 
interaction  variables.  The  structural  and  content  variables  are 
material-centered  and  are  not  dependent  upon  the  learner  in  any  way. 

The  learner-centered  variables  are  likewise  independent  of  the  struc- 
ture and  content.  The  two  interaction  categories,  however,  are  depen- 
dent both  upon  the  learner  and  upon  the  materials. 

The  task  of  studying  all  possible  combinations  of  the  variables 
involved  in  auto-instruction  is  extremely  impractical,  if  not  impossible 
Even  if  we  assumed  only  three  critical  levels  of  each  variable,  the 
number  of  different  combinations  of  the  variables  identified  by  Fry 
would  be  3212  (a  number  containing  106  digits).  Yet,  three  crucial 
levels  areprobably  an  underestimation  for  many  of  these  variables. 


Consequently,  the  number  of  variables  included  in  a single  study  has 
to  be  greatly  delimited. 

In  the  present  study  the  single  material-centered  variable  of 
frame  size  in  a linear  auto-instructional  text  was  manipulated  in  a 
one-way  design.  Here  frame  size  is  defined  as  the  amount  of  material 
presented  between  elicitations  of  responses.  While  some  theorists 
have  maintained  that  very  short  frames  with  small  steps  between  frames 
will  provide  optimal  learning  for  all  individuals  (e.g.,  Skinner, 

1958) , there  is  little  empirical  evidence  to  support  this  view.  In 
fact,  Deterline  (1967),  states  that  steps  which  are  too  small  "can 
actually  interfere  with  the  desired  learning"  (p.  212).  This  is  in 
agreement  with  Pressey's  position  that  "...detailed  small-step  pro- 
gramming may  be  exceedingly  useful  — but  surely  not  for  everybody 
on  everything"  (Pressey,  1960,  p.  503).  Several  studies  have  in- 
vestigated the  size  of  the  step  between  frames  (e.g.,  Coulson  and 
Silberman,  1960,  1961;  Shay,  1961;  Smith  and  Moore,  1962;  and  Kapel, 
1965) , but  there  have  been  very  few  studies  of  the  amount  of  material 
within  frames  (e.g.,  Krumboltz  and  Kiesler,  1965)  and  these  are  fre- 
quently confounded  by  additional  variables  (e.g.,  Briggs,  et  al.,  1962). 

It  should  be  pointed  out  that  frame  size  cannot  be  manipulated 
independently  of  all  other  material-centered  variables.  Obviously, 
the  frequency  of  response  is  a function  of  frame  size.  If  more  ma- 
terial is  presented  between  overt  responses,  it  follows  that  there  will 
be  fewer  responses  per  unit  time  and  per  unit  material.  Since  feed- 
back and  the  assumed  associated  reinforcement  are  contingent  upon  re- 
sponses, the  frequency  of  these  variables  will  also  vary  with  the 
frame  size.  To  a lesser  extent,  redundancy  of  material  and  cues  or 


-4- 


prompting  will  also  be  in  part  dependent  upon  frame  size.  Thus,  although 
only  frame  size  will  be  directly  manipulated,  other  variables  will  also  vary. 

The  rationale  for  selecting  frame  size  for  study  is  based  in  large 
part  upon  what  appears  to  be  a gap  between  reinforcement  theory  and  the 
mechanics  of  auto-instruction.  Skinner’s  auto-instructional  techniques 
grew  out  of  his  operant  studies  of  reinforcement  with  animals.  In  animal 
studies,  a reasonably  constant  stimulus  situation  elicits  constant  re- 
sponses which  are  usually  either  single  discrete  acts  (e.g.,  bar  pressing 
by  rats),  or  successive  approximations  to  particular  acts.  Auto-instruc- 
tion, however,  does  not  entirely  fit  the  operant  paradigm  as  used  in  the 
animal  studies,  since  neither  the  stimuli  nor  the  responses  remain  con- 
stant throughout  the  program  and  strictly  speaking  are  unique  for  each 
frame.  Ignoring  content,  there  is  little  question  that  the  act  of  overtly 
responding  can  become  an  operant  under  stimulus  control;  however,  the 
interest  in  programmed  instruction  is  not  the  act  of  responding  per  se, 
but  the  specific  content  of  the  response.  Hence,  if  we  are  to  assume  that 
the  learning  of  content  is  under  stimulus  control  and  is  reinforced  by 
feedback,  we  have  to  conclude  that  one-trial  learning  is  taking  place. 

That  is,  we  have  to  assume  that  a single  reinforcement  of  a specific 
response  is  sufficient  to  maintain  that  response  in  the  student’s  reper- 
toire over  an  indefinite  period  of  time.  If  this  is  the  case,  an  addi- 
tional problem  arises  with  small  frames  in  that  the  responses  will  per- 
haps only  be  related  to  the  limited  stimulus  content  of  that  frame 
(Deteriine,  1967). 

While  one-trial  learning  is  a possible  explanation  there  are  other 
problems  with  programmed  instruction  fitting  the  operant  paradigm.  For 


-5- 


instance,  auto-instruction  fails  to  match  the  operant  paradigm  because 
the  overt  responses  are  elicited  by  specific  prompts  or  cues  within  the 
frame.  This  situation  is  inconsistent  with  Skinner's  position  in  which 
the  operant  responses  are  emitted  rather  than  elicited.  Lumsdaine 
(1962)  has  discussed  this  point  and  suggests  that  programmed  instruction 
may  better  fit  contiguity  theory  than  it  does  reinforcement  theory  and 
that  the  relationship  between  the  cues  in  the  frame  and  the  responses 
parallels  the  classical  conditioning  paradigm.  Scandura  (1966)  suggests 
that  learning  principles  discovered  in  the  laboratory  may  have  little 
application  in  teaching  because  of  the  differences  in  the  situations. 

He  points  out,  for  example,  that  " 'shaping,'  the  operant  technique  for 
modifying  overt  behavior,  has  not  been  adequately  specified  with  respect 
to  meaningful  verbal  (i.e.,  symbolic)  material"  (p.  141).  These  con- 
siderations suggest  that  operant  reinforcement  principles  cannot  be 
directly  applied  to  auto-instruction,  and  calls  into  question  the 
practice  of  extrapolating  from  laboratory  operant  studies  to  human 
classroom  learning  situations. 

Even  if  reinforcement  theory  can  be  applied  directly  to  auto- 
instruction it  has  not  been  demonstrated  that  feedback  to  the  student 
as  to  the  correctness  or  incorrectness  of  his  response  does,  in  fact, 
provide  reinforcement  to  him.  If  the  student's  response  is  incorrect, 
viewing  the  correct  response  is  not  a parallel  situation  to  the  labora- 
tory counterpart  where  some  organism  fails  to  emit  a correct  response. 

In  the  auto-instruction  situation,  viewing  the  correct  response  provides 
a further  cue  for  a covert  or  implicit  correction  of  the  previous  incor- 
rect response.  In  the  laboratory  situation  an  incorrect  response  usually 
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produces  no  overt  reaction  at  all  from  the  experimenter  of  the  apparatus 
(except  in  avoidance  studies) . 

If  the  student's  response  is  correct,  seeing  the  correct  response 
does  not  necessarily  constitute  reinforcement  for  the  student.  Briggs, 
et  al.,  (1962)  point  out  that  for  easy  frames  overt  responses  and  the 
resultant  feedback  are  not  always  effective.  In  a study,  reported  by 
Briggs  and  his  colleagues  (1962) , students  who  only  read  frames  and  did 
not  overtly  respond  had  superior  test  scores  and  superior  time-efficiency 
for  auto-instructional  frames  of  low  difficulty  (based  on  the  degree  of 
cueing)  over  those  students  who  had  to  overtly  respond  to  the  same  frames. 
Licklider  (1962)  states  that  on  a computer-assisted  instructional  system 
the  unmotivated  student  found  scores  and  comments  essential  in  working 
through  programs,  while  the  highly  motivated  students  found  these  items 
much  less  important  and  concentrated  more  on  speed  and  convenience.  Thus, 
it  becomes  obvious  that  feedback  cannot  be  categorically  classified  as 
being  reinforcing.  Instead,  its  reinforcing  qualities  appear  to  be  a 
function  of  the  characteristics  of  the  particular  student  in  relation  to 
the  programmed  content.  Further,  a study  by  Krumboltz  and  Kiesler  (1965) 
suggests  that  reinforcement  is  present  in  programmed  instruction  with  or 
without  feedback. 

An  additional  problem  with  the  reinforcing  qualities  of  feedback  is 
that  programmed  instruction  does  not  make  allowances  for  the  student's 
previous  knowledge  in  the  particular  subject  area;  rather  all  students 
must  begin  at  the  same  point  and  are  requested  to  overtly  respond  even 
though  they  may  be  quite  familiar  with  the  material  and  find  it  extremely 
easy.  While  Skinner  (1958)  saw  this  situation  as  advantageous,  compelling 
the  student  to  respond  may  actually  impede  the  learning  process.  If  pro- 
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grammed  learning  does  involve  the  shaping  of  behavior  (Skinner,  1958), 
requiring  the  previously  knowledgeable  student  to  work  through  the 
program  involves  the  relearning  of  the  sequential  steps  rather  than  the 
relearning  of  the  final  behavior.  Consequently,  such  redundancy  does 
not  present  the  same  situation  as  repeated  trials  involving  the  same 
response.  The  analogous  laboratory  animal  situation  would  be  exempli- 
fied by  having  a pigeon  which  has  been  trained  to  turn  clockwise  circles 
relearn  this  response  through  the  original  process  of  selectively  rein- 
forcing small  approximations  to  clockwise  circles.  Such  a procedure 
would  be  inefficient  both  in  terms  of  the  experimenter's  time  and  in 
terms  of  the  pigeon's  time  and  would  probably  interfere  with  the  pigeon's 
complete  circle  turning;  yet  this  activity  is  what  we  ask  students  to  do 
when  we  have  them  go  through  auto-instructional  material  involving  con- 
tent with  which  they  are  already  familiar. 

Hence,  the  study  is  a test  of  the  hypothesis  that  the  programmed 
material  must  be  broken  into  small  parts,  each  of  which  elicits  a re- 
sponse which  is  subsequently  reinforced.  The  specific  problem,  then, 
is  to  determine  if  efficiency  of  learning  is  dependent  upon  frame  size. 
The  study  is  exploratory  in  nature;  no  specific  alternative  hypothesis 
is  being  directly  tested. 

Related  Research 

Frame  size  defined  as  the  amount  of  material  presented  between 
responses  has  received  little  attention  in  the  literature.  The  ''step- 
size"  studies  have  usually  been  concerned  with  the  difficulty  of  giving 
a correct  response  at  any  point  in  the  program  (Coulson  and  Silberman, 
1960,  1961;  Shay,  1961;  Smith  and  Moore,  1962;  and  Kapel,  1965)  rather 
than  with  the  length  of  the  material  between  responses.  However,  "step- 


size"  has  had  at  least  four  different  meanings:  (a)  the  length  of 

the  program  frame  (i.e.,  frame  size  as  defined  in  the  present  study); 

(b)  the  difficulty  of  giving  a correct  answer  (size  of  step  between 
frames) ; (c)  the  complexity  or  length  of  the  response  required  of  the 
student  before  he  receives  feedback  (Evans,  1965).  These  different 
usages  have  resulted  in  some  confusion  concerning  the  results  of  size 
of  step"  as  they  use  it.  Much  of  the  literature  on  this  topic  has 
not  bearing  on  the  present  study,  and  will  be  omitted  here. 

One  type  of  study  which  superficially  resembles  the  present  one 
is  the  comparison  of  short-framed  Skinnerian  programs  (extrinsically 
sequenced)  with  the  longer-framed  branching  programs  (intrinsically 
sequenced).  However,  unless  unusually  strict  controls  are  effected,  the 
contributions  of  frame  size  cannot  be  separated  out  from  the  other 
variables  such  as  mode  of  responding  and  method  of  sequencing.  As 
Crowder  points  out,  "Linear  and  instrinsic  programming,  while  having 
some  superficial  similarities,  are  basically  different  in  approach, 
intention  and  rationale"  (1963,  p.  254).  Consequently,  studies  of  this 
type  will  not  be  considered. 

Another  type  of  study  which  appears  to  be  similar  is  the  comparison 
of  autc-instruccionai  programs  to  conventional  text  materials.  While  in 
a sense  these  studies  are  a comparison  of  different  frame  lengths,  there 
are  also  qualitative  differences  between  auto-instructional  programs  and 
conventional  material.  Conventional  materials  require  no  overt  responding 
and,  thus,  require  no  feedback  or  controlled  reinforcement.  Thus,  these 
studies  are  not  strictly  a study  of  different  frame  size. 
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In  one  example  of  a study  in  which  auto-instruction  was  compared 
to  a conventional  text,  Briggs  et  al*  (1962)  used  four  types  of  material 
to  teach  a lesson  on  the  structure  and  the  function  of  the  United  Nations. 
Two  forms  of  the  material  were  conventional  (one  with  review  material  and 
one  without)  and  two  forms  were  Skinner-type  programs  (one  requiring  the 
first  letter  of  the  response  word  only,  and  the  other  requiring  the  com- 
plete response  word).  The  investigators  found  that  the  conventional 
groups  were  more  efficient  in  terms  of  time  than  were  the  Skinnerian 
groups.  On  the  criterion  tests  (multiple  choice)  the  Skinnerian  groups 
excelled  when  the  correct  answers  were  identical  to  the  responses  learned 
in  the  program,  but  the  conventional  groups  excelled  when  the  correct 
answers  were  different  than  the  responses  in  the  programs.  Briggs  and 
his  associates  concluded  that,  "if  you  employ  a Skinner- type  program, 
do  not  expect  even  a moderate  level  of  learning  of  program  content  which 
is  not  elicited  during  training"  (p.  92). 

In  a second  study  reported  by  Briggs  et  aL  (1962) , one  group  of 
students  worked  through  linear  programs  which  required  overt  respond- 
ing, while  another  group  read  the  same  material  with  the  responses 
filled  in.  The  reading  group  surpassed  the  overt  group  in  time-effi- 
ciency for  all  of  the  materials,  and  surpassed  them  in  test  scores 
for  the  easy  frames  (based  on  the  cues) . This  study  suggests  that  for 
those  frames  which  provide  easy  prompts  for  a student,  overt  respond- 
ing is  an  ineffectual  method  compared  to  reading.  In  the  present 
study,  no  attempt  has  been  made  to  control  prompting,  but  it  was  anti- 
cipated that  the  effect  reported  by  Briggs  et  al.  will  be  present  across 
material  versions  representing  different  frame  sizes. 
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Krumboltz  and  Kiesler  (1965)  studied  the  effects  of  partial  rein- 
forcement in  programmed  instruction  by  varying  frame  size  as  defined 
in  the  present  study  and  by  varying  the  availability  of  feedback.  They 
used  six  versions  of  a rather  short  (177  - frames)  program  on  interpret- 
ing educational  test  results.  These  versions  were: 

1.  The  standard  program  in  which  a question  was  asked  on  every 
frame  with  the  answers  provided. 

2.  A question  on  every  frame  with  the  answer  provided  for  every 
fifth  frame. 

3.  A question  on  every  frame  with  the  answer  provided  for  every 
tenth  frame. 

4.  A question  only  on  every  fifth  frame  with  the  answers  provided. 

5.  A question  only  every  tenth  frame  with  the  answers  provided. 

6.  No  questions  asked  with  all  frames  consisting  of  declarative 
statements . 

These  versions  presumably  are  arranged  in  descending  order  of  the 
reinforcement  which  they  provide.  Criterion  test  results  of  the  high 
school  students  in  the  study  tended  to  support  the  hypothesis  that  the 
more  reinforcement  the  better  the  test  scores,  with  the  exception  that 
the  mean  for  the  fifth  group  was  lower  than  that  for  the  third,  fourth, 
and  sixth  groups.  The  researchers  did  an  analysis  of  variance  on  the 
test  scores  which  produced  a significant  F (F  = 3.05;  df  = 5,  77; 
p < .05)  for  the  test  given  immediately  upon  completion  of  the  materials 
to  half  of  the  subjects.  They  did  not,  however,  make  individual  com- 
parisons of  the  means  to  determine  which  values  differed.  A second 
analysis  of  variance  using  scores  on  a delayed  test  (approximately  two 
months)  given  to  the  other  half  of  the  subjects  failed  to  produce  a 
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significant  F. 

Unfortunately,  examination  of  the  immediate  test  data  presented 
by  Krumboltz  and  Kiesler  shows  that  the  variances  are  not  homogeneous. 

By  squaring  the  reported  standard  deviations,  computation  of  Hartley's 
F-max  statistics  gives  a value  of  9.46,  which  for  six  groups  and  20  df, 
is  significant  at  the  .01  level. * While  the  analysis  of  variance  F- 
test  is  usually  robust  in  the  face  of  a lack  of  homogeneity  of  variance, 
this  robustness  fails  in  the  face  of  unequal  sample  sizes.  Generally, 
a lack  of  homogeneity  will  inflate  the  F-value  and  produce  significance 
where  otherwise  it  would  not  exist  (Cochran  and  Cox,  1957;  Edwards,  1960). 
Hence,  it  is  quite  possible  that  if  the  scores  were  corrected  for  the  lack 
of  homogeneity,  the  obtained  F would  not  be  significant. 

While  the  reported  evidence  suggests  that  the  larger  the  frame  size, 
the  better  the  performance,  this  may  be  due  only  to  poor  statistical 
handling  of  the  data.  In  addition,  the  experimenters  had  no  control 
over  the  student's  acquisition  experiences  and  as  reported  above,  the 
program  was  quite  short.  These  considerations  cast  doubt  on  the  validity 
of  the  findings. 

In  summary,  there  has  been  little  research  on  the  variable  of 
frame  size  and  the  evidence  is  inconclusive.  While  Briggs  (1968) 
in  his  review  of  "Learner  Variables  and  Educational  Media,"  states 
that  the  Skinnerian  theory  that  small  steps  are  needed  has  not  gone 

*Use  of  the  F-max  statistic  assumes  equal  n's.  Since  the  n's  are 
unequal  the  largest  number  of  degrees  of  freedom  was  used  which  gives  a 
more  powerful  test  than  any  other  number.  However,  even  using  the  mean 
number  of  degrees  of  freedom,  13.8,  the  F-max  statistic  is  significant 
at  the  .01  level. 
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unchallenged,  he  does  not  cite  one  study  which  is  concerned  with 
the  length  of  the  frame  as  defined  in  the  present  study.  In  1962, 
Silberman  stated  that  the  research  on  programmed  instruction  at 
that  time  had  barely  scratched  the  surface  and  that  "beyond  demon- 
strating that  a carefully  written  set  of  materials  will  teach  if 
a student  will  spend  enough  time  studying  them,  we  have  little 
unequivocal  evidence  for  the  principles  of  programmed  instruction." 

Although  six  years  have  passed  since  then,  the  situation  has 
changed  little.  Many  of  the  difficulties  have  been  in  the  studies 
themselves;  ample  examples  of  these  deficits  are  presented  elsewhere 
(Silberman,  1962;  Tuel,  1966;  Flynn,  1967).  Frequently,  studies 
have  not  been  well  designed  or  well  conducted  and  many  extraneous 
sources  of  variation  have  confounded  the  results.  A frequent 
weakness  is  that  many  investigators  use  extremely  short  programs 
which  may  not  permit  a definite  pattern  to  establish  itself  or  for 
novelty  effects  to  dissipate. 

Objectives 

The  purpose  of  the  study  was  to  explore  an  aspect  of  the  general 
problem  of  identifying  optimal  learning  materials  for  given 
individuals.  The  specific  question  being  asked  is  the  following: 

"Is  there  an  optimal  frame  size  for  learning  to  occur  in  linearly 
sequenced  auto-instructional  programs?"  Following  from  this  ques- 
tion, "If  the  frame  size  in  linearly  programmed  instructional 
material  is  varied  while  all  other  material-centered  variables  are 
held  constant,  will  there  be  a differential  efficiency  of  learning?." 


CHAPTER  II:  PROCEDURES 


Sample 

Eighty-four  students  were  randomly  selected  from  the  10th  grade  and 
84  from  the  11th  grade  at  Nova  High  School  (Ft.  Lauderdale,  Florida).1 
Forty-two  of  the  students  in  each  grade  are  male  and  42  are  female.  No 
attempt  was  made  to  representatively  sample  the  student  body  according  to 
any  a priori  selection  of  variables.  However,  each  home  room  (AdCom  room) 
was  sampled  in  proportion  to  its  relative  size.  According  to  school  board 
policy,  parents  were  notified  by  mail  of  the  project  and  consequently  had 
an  opportunity  to  withdraw  their  children. 

Nova  High  School  is  an  experimental  school  in  Broward  County,  Florida. 
The  students  are  predominately  from  middle  class  homes  in  the  Ft.  Lauderdale 
area.  This  school  was  selected  primarily  because  the  independent  study 
programs  and  modular  scheduling  permitted  the  needed  flexibility  to  have 
the  students  participate  in  the  study. 

It  was  found  at  the  time  of  the  study  that  one  tenth  grade-girl  who 
had  been  selected  was  not  actually  enrolled  at  the  high  school.  In  addi- 
tion, a boy  who  was  reported  by  his  mother  to  be  a non-reader  was  excused 
from  participating.  Of  the  remaining  166  students  only  98  completed  all 
parts  of  the  study.  The  reasons  for  the  attrition  are  presented  in 
Appendix  A.  The  resultant  composition  of  the  sample  and  the  effects  of 
the  high  attrition  rate  on  the  data  are  analyzed  and  discussed  later  in 
this  chapter. 

1The  author  is  indebted  to  Dr.  James  E.  Smith  and  Mr. David  Fitzpatrick 
for  allowing  the  students  to  participate  in  the  study,  and  to  the  entire 
staff  of  Nova  High  School  for  their  excellent  cooperation. 
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Materials 

A commercially  prepared  auto-instructional  program  on  objective 
psychology  was  selected  for  use  in  the  study  (Holland  and  Skinner,  1961). 2 
The  criteria  used  in  selecting  a program  were  (1)  that  the  subject  be 
one  with  which  high  school  students  have  a high  probability  of  being  un- 
familiar; (2)  that  the  length  be  such  as  to  provide  at  least  ten  hours  of 
instruction;  and  (3)  that  the  program  be  representative  of  the  principles 
of  the  Skinnerian  approach  (i.e.,  small  frames,  overt  responses,  and  linear 
sequencing).  The  last  criterion  was  included  because  only  programs  of 
this  type  can  be  readily  adapted  to  the  needs  of  the  study.  For  example, 
if  the  program  was  branching,  it  would  be  extremely  difficult  to  combine 
frames  to  make  new  frames.  Further,  branching  programs  generally  have 
longer  frames  and  multiple-choice  responses. 

Nine  hundred  sixty  frames  were  selected  from  the  first  24  sets  of 
the  program  for  use  in  the  study.  Ninty-six  of  these  frames  were  used 
as  introductory  material  to  familiarize  the  students  with  the  mode  of 
instruction  and  were  not  used  for  collecting  data.  The  remaining  864 
frames  were  used  for  the  study  and  provided  the  informational  base 
against  which  data  were  collected.  The  introductory  material  was  pre- 
sented in  a booklet  separate  from  the  other  material.  These  booklets 
were  designated  Part  I and  Part  II  respectively.  In  addition  to  frames. 
Part  I contains  one  "Exhibit"  (i.e.,  graphs,  tables,  pictures,  explana- 
tory materials,  etc.)  and  Part  II  contains  13  "Exhibits.'* 

2The  program  was  used  with  the  permission  of  the  publisher, 
McGraw-Hill,  Inc. 
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The  program  was  rewritten  into  seven  versions  with  all  variables 
except  frame  size  per  response  held  constant.  These  versions  are: 

(1)  Small  frames  as  originally  written. 

(2)  Material  of  two  of  the  smaller  frames  combined. 

(3)  Material  of  four  of  the  smaller  frames  combined. 

(A)  Material  of  eight  of  the  smaller  frames  combined. 

(5)  Material  of  16  of  the  smaller  frames  combined. 

(6)  Material  of  32  of  the  smaller  frames  combined. 

(7)  Material  of  all  of  the  smaller  frames  combined  so  that  no 
responding  is  required. 

For  illustration,  the  following  are  two  adjacent  original  frames: 

A light  flashed  into  the  eye  elicts 
constriction  of  the  pupil.  This 
sequence  is  called  the  pupillary . 

In  the  pupillary  reflex,  a flash  of 
light  is  said  to  the  response. 

These  two  frames  are  retained  verbatim  in  version  1 of  the  material. 

In  version  2 they  would  be  modified  as  follows: 

A light  flashed  into  the  eye  elicts 
constriction  of  the  pupil.  This  sequence 
is  called  the  pupillary  reflex. 

In  the  pupillary  reflex,  a flash  of 
light  is  said  to the  response. 

The  only  change  is  that  the  response  "reflex'"  is  filled  in  for  the 

first  original  frame.  Consequently,  only  one  response  is  now  required 

for  the  two  frames  combined.  The  original  spacing  and  format  is  maintained 

so  that  the  only  change  is  the  frame  size  or  frequency  of  responding.  In 


the  subsequent  versions,  responses  only  occur  for  the  last  original  frame 
in  each  of  the  composite  frames.  Thus,  each  successive  version,  with  the 
exception  of  version  7,  requires  only  half  as  many  responses  as  the  pre- 
ceding version.  Version  7 requires  no  responses  at  all  and  is,  there- 
fore, qualitatively  different  than  the  other  versions. 

Examples  of  each  of  the  seven  versions  are  presented  in  Appendix  B. 
All  of  the  versions  contain  an  equal  number  of  the  original  frames  (i.e., 
the  same  amount  of  instructional  material) . 

Before  the  preparation  of  the  different  versions,  the  material  was 
edited  so  that  all  the  frames  required  only  a single  response  instead 
of  multiple  responses  as  many  did  originally.  The  booklets  were  then 
prepared  by  arranging  eight  of  the  original  frames  on  a page.  Response 
blanks  were  filled-in  with  the  correct  answers  or  left  blank  according 
to  the  version  as  outlined  above.  Several  frames  were  arbitrarily 
eliminated  so  that  the  total  was  evenly  divisible  by  the  32  frames  in 
version  6.  In  one  case,  a non-technical  word  which  would  probably  have 
been  unfamiliar  to  the  students  in  the  sample  was  changed  to  a more 
familiar  word.  Several  of  the  original  frames  in  the  commercial  program 
required  the  student  to  select  a multiple-choice  answer.  These  frames 
were  rewritten  to  require  a structured  response  by  the  student. 
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Design 

The  sample  of  168  students  was  randomly  divided  into  seven 
groups  of  24  students  each,  with  sex  equally  divided  in  each  group. 
Each  group  worked  through  one  of  the  versions  of  the  material  as 
outlined  above  under  supervised  classroom  conditions.  The  students 
were  unable  to  remove  the  materials  from  the  classroom  or  to  use 
them  except  at  the  designated  time  each  day.  Prior  to  beginning  on 
the  programs,  the  separate  introductory  material  (Part  I)  for  that 
particular  version  of  the  material  was  used  to  familiarize  the  stu- 
dent with  that  version.  No  data  from  the  familiarization  period 
were  collected. 

In  order  to  minimize  and  counterbalance  extraneous  sources  of 
variation,  the  original  168  students  were  physically  divided  into 
six  groups  of  28  students  each.  Four  students  within  each  of  these 
physical  groupings  working  with  each  of  the  seven  material  versions, 
so  that  the  students  in  each  version  were  distributed  equally  across 
the  physical  groupings.  Each  of  the  six  groups  was  scheduled  to  meet 
for  64  minutes  a day  for  the  duration  of  the  project  (about  two  and 
one-half  weeks).  Each  of  the  groups  met  at  different  times  each  day, 
rotating  systematically  through  six  blocks  of  time.  Thus,  all  groups 
were  given  the  opportunity  of  working  at  varying  times  throughout  the 
day;  no  group  had  to  work  continually  at  a disadvantageous  time,  and 
no  student  had  to  miss  the  same  regular  class  on  successive  days. 

Because  the  project  was  superimposed  upon  the  regular  school 
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activities,  students  were  given  the  option  of  coining  at  different 
times  than  those  for  which  they  were  scheduled.  Students  frequently 
selected  this  option, which  resulted  in  further  scrambling  of  the 
experimental  groups.  The  result  was  that  disturbances  such  as  an 
electrical  power  failure  which  did  occur  during  the  study  did  not 
adversely  affect  any  one  experimental  group  alone,  but  affected 
some  membersof  all  the  experimental  groups. 

It  was  not  feasible  to  obtain  a single  monitor  to  supervise 
the  students  for  the  duration  of  the  study.  Instead,  a group  of 
female  adult  volunteers  was  employed  as  monitors.  There  were 
five  regular  volunteers  who  rotated  days  and  five  substitute  volun- 
teers. Initially  the  monitors  worked  in  pairs,  and  then  later 
alone.  Each  of  the  students  was  exposed  to  all  of  the  regular 
monitors  and  most  of  them  to  one  or  more  of  the  substitutes. 

Data  Sources 

Four  sources  of  data  were  collected  for  the  study.  These  are: 

(1)  An  achievement  test  was  given  to  each  student  the  day 
after  he  completed  the  materials  to  determine  his  know- 
ledge of  the  material.  The  development  of  this  test  is 
described  in  Chapter  III  and  a copy  of  it  appears  in 
Appendix  C. 

(2)  Completion  time  was  recorded  for  each  student  to  the 
nearest  five  minutes.  This  is  the  time  which  the  student 
had  the  material,  and  not  necessarily  the  actual  time  he 
worked.  Occasionally  times  were  not  entered;  in  those 
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cases  the  time  the  student  was  scheduled  was  used.  In 
general,  this  will  inflate  the  time  figure. 

(3)  The  number  of  response  errors  made  by  each  student  in 
working  through  the  materials  was  recorded  daily  by 
the  students  themselves.  While  this  method  allows 
room  for  students  to  falsify  the  records,  it  has  been 
used  with  apparent  satisfaction  by  other  investigators 
(e.g.,  Kapel,  1965). 

(4)  A brief  questionnaire  was  completed  by  the  students 
immediately  upon  completion  of  the  materials  to  deter- 
mine their  previous  experience  with  psychology  and 
their  reaction  to  the  materials . The  questionnaire  is 
discussed  in  Chapter  III  and  a copy  of  it  appears  in 
Appendix  C. 

(5)  A followup  questionnaire  was  sent  to  all  of  the  stu- 
dents after  completion  of  the  study  to  obtain  informa- 
tion about  the  reasons  for  attrition  and  about  the 
"Students1'"  opinions  on  some  aspects  of  the  study.  The 
questionnaire  is  presented  in  Appendix  C. 

Additionally,  SCAT  scores  were  available  in  the  school  files 
for  most  of  the  students  as  an  independent  variable.  The  SCAT  was 
given  statewide  in  the  ninth  grade  as  part  of  a test  battery.  In 
general,  when  the  SCAT  scores  are  not  available  for  a student,  it 
means  that  he  has  transferred  into  Florida  schools  from  out  of  state 
after  the  time  that  the  SCAT  was  given  for  his  class.  Since  the 
present  sample  consisted  of  tenth  and  eleventh  grade  students,  the 
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SCAT  scores  were  one  and  two  years  old  for  these  students.  Conse- 
quently, the  SCAT  scores  may  not  adequately  represent  a measure  of 
the  students  present  intellectual  capabilities.  This  fact  should 
be  kept  in  mind  in  interpreting  the  results. 

Unfortunately,  the  scores  obtained  had  been  converted  to  per- 
centiles based  upon  Florida  norms  and  the  raw  scores  were  not  avail- 
able. In  order  to  eliminate  the  underlying  rectangular  distribution, 
the  percentiles  were  converted  to  normalized  T scores  (mean  of  50, 
standard  deviation  of  10) . This  conversion  did  not  necessarily 
normalize  the  sample  used  in  the  study,  but  only  normalized  the  under- 
lying distribution  of  the  Florida  norms.  These  transformed  scores 
were  used  for  all  analyses  involving  the  SCAT. 

Attrition 

Tne  high  rate  of  attrition  reported  earlier  in  this  chapter  gives 
cause  for  concern  about  the  representiveness  of  the  students  remain- 
ing in  the  study.  Consequently,  considerable  effort  has  been  expended 
to  ascertain  the  reasons  for  the  attrition  rate  and  to  determine  what 
effect  it  may  have  had  on  the  data.  Questionnaires  were  sent  to  all 
students  to  find  out  why  they  did  not  finish  (See  Appendix  A).  Table  1 
reports  the  completion  status  of  the  166  students  who  were  to  have 
participated  in  the  study  (excluding  the  two  students  discussed  pre- 
viously). As  can  be  seen  from  this  table,  a total  of  15  students  did 
not  start  the  study  at  all.  The  reasons  for  this  failure  to  partici- 
pate which  were  given  by  those  students  who  responded  to  the  question- 
naire are  presented  in  Appendix  A.  Unfortunately  many  of  the  students 
also  declined  to  respond  to  the  questionnaire.  When  permission  was 
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Table  1 


Completion 

Status  of  166 

Students 

Who 

Were  Selected  for 

the  Study3 

Version 

Did  Not  Participated  But 

Start  Did  Not  Finish 

Finished 
Parts  of 

All 

Study 

Total 

1 

5 

8 

10 

23 

2 

3 

9 

12 

24 

3 

2 

6 

16 

24 

4 

1 

9 

14 

24 

5 

1 

6 

16 

23 

6 

1 

11 

12 

24 

7 

2 

4 

18 

24 

Total 

15 

53 

98 

166 

al68  students  were  originally  selected  for  the  study.  One 
girl  was  not  enrolled  in  school  and  one  boy  was  excused  because 
he  was  an  aphasic  non-reader. 

granted  for  the  study  by  the  school  administration  officials,  it  was 
understood  that  the  students  must  have  the  option  of  not  participating. 
Consequently,  no  effort  was  made  to  force  these  15  students  to  partici- 
pate, and  if  such  an  effort  had  been  made,  it  could  have  been  detri- 
mental to  the  study  if  the  students  already  held  or  formed  negative 
attitudes  about  participating.  There  is  no  reason  to  believe  that  the 
failure  of  these  15  students  to  participate  was  not  random  in  regards 
to  the  treatment  groups.  Since  they  did  not  participate  at  all,  they 
did  not  know  that  different  treatment  groups  existed,  and,  of  course, 
did  not  know  to  which  one  they  were  assigned. 


Of  the  151  students  who  began  the  study,  only  98  or  almost  65%  com- 
pleted the  study  through  taking  the  achievement  test.  Seventeen  or 
approximately  11%  of  the  students  finished  reading  the  material,  but 
did  not  return  to  take  the  test.3  Six  students  or  about  4%  quit 
participating  after  only  one  or  two  sessions.  The  remaining  21% 

(31  students)  participated  most  or  all  of  the  time,  but  did  not 
complete  the  study.  These  three  groups  of  students  total  53  and 
are  combined  in  column  3 of  Table  1.  The  reasons  for  not  finishing 
that  are  given  by  the  students  among  these  53  who  responded  to  the 
followup  questionnaire  are  listed  in  Appendix  A. 

The  high  rate  of  attrition  was  in  large  part  attributable  to 
the  timing  of  the  study.  Because  of  local  school  system  problems, 
the  starting  date  of  the  study  was  postponed  one  week  from  the 
originally  scheduled  date.  This  shift  in  the  starting  date  caused 
the  third  week  of  the  project  to  coincide  with  the  final  week  of 
the  high  school's  grading  period.  Consequently,  many  students  who 
had  not  yet  completed  the  study  had  to  withdraw  to  complete  their 
regular  course  requirements. 

A x2  was  caluclated  for  the  data  in  columns  3 and  4 of  Table  1 
to  determine  the  randomness  of  attrition  across  versions.  The  stu- 
dents in  column  2 who  did  not  start  the  study  were  omitted  in  part 
because  the  expected  frequencies  in  all  of  the  cells  in  that  column 
are  below  5.  Further,  however,  it  is  assumed  that  the  failure  of 

frequently  stated  reasons  for  failure  to  return  for  the  test 
are  absence  from  school  and  interference  with  other  school  work. 

See  Appendix  A for  other  stated  reasons. 
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these  students  to  participate  is  random  across  versions  as  discussed 
above.  For  this  reason,  the  frequencies  in  the  second  column  were 
not  added  to  those  in  the  third  column  to  calculate  the  x2-  The 
obtained  x2  Ts  5.48  (df  = 6)  which  is  not  significant  (p  > .40). 
Hence,  it  is  assumed  that  the  students  not  completing  the  study  are 
numerically  randomly  distributed  across  versions. 

Of  the  students  who  completed  the  study,  28  were  tenth-grade 
boys,  29  were  tenth-grade  girls,  18  were  eleventh-grade  boys,  and 
23  were  eleventh-grade  girls.  Table  2 presents  these  data  and  reports 


Table  2 

Students  by  Sex  and  Grade  Level  Completing  the  Study 


Grade 

Sex 

Tenth 

Eleventh 

Total 

Male 

28 

18 

46 

Female 

29 

23 

52 

Total 

57 

41 

98 

Using  marginal 

totals:  x2  * 

« 24  ^ d « i « i ^ in  * b • 

Assuming  equal 

frequencies : 

X2  - 3.17,  d.f.  = 1,  n.s. 

two  non-significant  x2's-  A x2  was  calculated  with  the  expected  fre- 
quencies based  upon  the  marginal  totals  to  determine  if  a relationship 
exists  between  the  grade  level  and  the  sex  of  the  students  wuo  finished. 
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The  calculated  value  is  .24  (df  = 1)  which  is  not  significant (p  > .80). 
Since  sex  and  grade  level  were  originally  equally  represented,  a 
second  % was  calculated  using  equal  expected  frequencies  in  each  cell 
(24.5)  to  determine  if  either  grade  or  sex  was  unequally  affected  by 
the  attrition.  The  value  of  this  x2  is  3.17  and  also  is  not  signifi- 
cant (p  < .10). 

Similarly,  Table  3 presents  a contingency  table  showing  the  dis- 
tribution of  sex  across  versions  of  the  students  who  completed  the 
study.  A x2  was  calculated  using  the  expected  frequencies  which  are 


Table  3 

Students  by  Sex  and  Version  Completing  the  Study 


Sax 

Version 

Male 

Female 

Total 

x 

4 

6 

10 

2 

C 

.J 

7 

12 

3 

5 

7 

16 

4 

7 

7 

14 

5 

9 

7 

16 

6 

*3 

9 

12 

7 

5 

9 

18 

Total 

46 

52 

98 

Using  marginal  totals: 

2 

X " 

3.84;  d.f.  - 

6,  n.s. 

Assuming  equal  expectancies:  x2  * 7.14;  d.f.  = 6,  n.s. 
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based  upon  the  marginal  totals  to  determine  if  the  resultant  fre- 
quencies of  each  sex  are  distributed  evenly  across  versions.  The 
value  is  3.84  (df  = 6)  which  is  not  significant  (p  > .60).  A second 
X was  calculated  to  ascertain  if  the  resultant  frequencies  represent 
more  than  a chance  departure  from  the  originally  intended  equal  cell 
frequencies.  An  expected  frequency  of  seven  was  used  for  all  cells 
and  the  value  of  x^  is  7.14  (df  *»  6).  This  value  is  not  significant 
either  (p  > .30).  Consequently,  it  appears  that  the  attrition  rate 
is  reasonably  uniform  across  the  categories  of  sex,  grade  level,  and 
treatment  group  and  that  it  does  not  represent  any  trends  which  will 
adversely  influence  the  data. 

Because  one  of  the  problems  contributing  to  the  attrition  was  an 
apparent  lack  of  time  in  the  third  week  of  the  study,  many  of  the 
slower  reading  students  may  have  been  forced  out  of  the  study.  Also, 
students  of  lower  ability  would  be  more  likely  to  need  to  direct  all 
of  their  energies  to  their  regular  courses  than  would  students  of 
high  ability  (keeping  in  mind  of  course  that  the  correlation  between 
ability  and  high  school  achievement  is  far  from  perfect) . Since  stu- 
dents of  low  anility  are  frequently  slow  readers  it  would  seem  likely 
that  these  students  would  be  still  working  on  the  materials  in  the 
third  week  and  would  have  to  drop  from  the  study  when  pressed  for 
time.  Another  factor  influencing  the  speed  with  which  the  students  com- 
pleted the  material  is  that  each  version  requires  a different  number  of 
responses.  Students  assigned  to  versions  requiring  a nigh  frequency  of 
overt  responding  should  tend  to  take  more  time  to  complete  the  material 


than  students  who  are  assigned  to  materials  with  a low  frequency  of 
overt  responding,  because  the  responding  itself  will  take  more  time.4 
Thus,  it  would  seem  likely  the  high  frequency  of  response  groups  would 
suffer  most  from  the  attrition  problem  of  the  third  week  of  the  study. 
If  this  situation  did  occur  the  resulting  data  could  no  longer  be 
considered  random;  rather  an  interaction  would  be  present  between 
ability  (and/or  working  speed)  and  the  material  versions.  Fortunately, 
the  evidence  does  not  indicate  that  this  happened. 

A x was  calculated  for  the  marginal  frequencies  for  versions  in 
Table  3 and  a value  of  3.43  (df  = 6)  was  obtained.  This  x2  is  not 
significant  (p  > .70),  which  indicates  that  the  final  frequencies  for 
the  versions  are  probably  randomly  distributed.  Hence,  it  is  unlikely 
that  a significantly  greater  number  of  students  was  forced  out  of  one 
version  than  was  forced  out  of  another  by  the  third-week  time  squeeze. 

Even  though  the  numbers  of  students  may  be  random  across  versions, 
the  characteristics  of  the  students  such  as  ability  or  other  learner- 
centered  variables  could  be  disproportionately  distributed  among 
the  versions.  Consequently,  in  order  to  better  assess  the  students 
who  completed  the  study,  an  analysis  of  variance  was  carried  out  for 
total  SCAT  scores  in  a participation  by  version  interaction  design. 

Two  levels  of  participation — those  who  participated  but  did  not  com- 
plete the  materials  and  those  who  did  complete  the  materials — were 

4The  results  of  the  analysis  of  the  completion  time  data  are  pre- 
sented in  Chapter  IV.  The  actual  relationship  between  versions  and 
completion  time  appears  to  be  curvilinear  across  the  range  of  versions 
studied;  high  response  frequency  requires  high  completion  time,  a 
minimum  is  reached  around  version  5,  and  then  the  times  increase  again. 


-27- 


used  together  with  the  seven  versions.  The  15  students  who  did  not 
participate  at  all  were  excluded  because  it  is  assumed  (as  discussed 
above)  that  their  nonparticipation  was  random  across  versions.  It  was 
reasoned  that  if  the  interaction  is  significant  or  if  the  versions'  main 
effect  is  significant,  then  the  attrition  (or  the  original  selection  or 
both)  has  caused  ability  as  measured  by  SCAT  scores  to  be  unequally 
distributed  across  the  cells. 

SCAT  scores  were  not  available  for  all  of  the  students  as  discussed 
earlier  in  this  chapter.  The  cell  sizes  representing  the  available 
scores  are  listed  in  Table  4 together  with  the  means  and  standard  de- 
viations. The  summary  of  the  analysis  is  presented  in  Table  5.  Only 
the  main  effect  for  participation  is  significant,  which  implies  that 
the  main  result  of  the  attrition  in  regards  to  ability  was  to  cut  off 
the  bottom  of  the  distribution  relatively  evenly  across  versions. ^ 


'’The  variances  of  the  SCAT  scores  are  not  homogeneous.  Hartley's 
maximum  F-ratio  test  was  used,  and  since  the  cell  sizes  vary  the  table 
was  entered  with  the  maximum  degrees  of  freedom  which  should  inflate  the 
alpha  level  increasing  the  power  of  the  test.  The  F-max  value  is  31.9 
(k  = 14,  df  = 13)  which  is  significant  at  the  .01  level.  While  the  lack 
of  homogeneity  would  not  greatly  influence  the  F test  if  the  cell  sizes 
were  equal,  the  cell  sizes  are  not  equal.  In  general,  a lack  of  homo- 
geneity will  inflate  the  value  of  F and  will  produce  more  significant 
results  than  would  otherwise  be  obtained  (Cochran  and  Cox,  1957  ; Edwards, 
1960) . Since  the  percentile  scores  are  more  homogeneous  than  the  nor- 
malized scores  (F-max  value  for  percentile  scores  is  29.0),  an  analysis 
of  variance  using  percentile  scores  was  completed.  This  produced  a F 
value  of  11.5  (p  < .05)  for  participation  main  effects  and  reduced  F's 
for  versions  and  the  interaction.  A square  root  transformation  of  the 
percentile  scores  (y'  = v/X_ ) further  reduced  the  lack  of  homogeneity 
(F-max  = 20.3),  but  still  did  not  eliminate  it.  An  analysis  of  variance 
using  the  square  root  transformed  scores  produced  an  F value  of  6.72  for 
paticipation  main  effects  which  is  significant  at  the  5%  level.  The 
interaction  and  the  main  effect  for  versions  were  not  significant  and 
all  the  F values  using  the  transformed  scores  are  lower  than  those  ob- 
tained with  the  original  scores.  If  a transformation  was  found  which 
made  the  variances  homogeneous,  the  expected  effect  on  the  analysis 
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Table  4 


Means  and  Standard  Deviations3  of  the  Available  SCAT 
Scores  (Total)  of  the  Students  Who  Participated 


Version 

Not  Completed 

Completed 

Total 

N 

8 

8 

16 

1 

MEAN 

52.7 

56.2 

54.5 

SD 

12.0 

8.5 

10.7 

2 

N 

7 

10 

17 

MEAN 

51.4 

61.3 

57.2 

SD 

6.1 

7.5 

8.5 

N 

6 

12 

18 

*5 

MEAN 

56.0 

57.2 

56.8 

SD 

7.1 

7.3 

7.2 

4 

N 

S 

14 

' 22 

MEAN 

51.0 

61.0 

57.4 

SD 

10.4 

12.6 

12.8 

N 

5 

12 

17 

5 

MEAN 

48.4 

57.6 

54. S 

SD 

10.2 

8.0 

9.7 

N 

10 

10 

20 

6 

MEAN 

50.9 

58. C 

54.4 

SD 

10.9 

7.7 

10.1 

N 

3 

14  * 

17 

7 

MEAN 

58.0 

57.1 

57.2 

SD 

2.2 

8.3 

7.5 

N 

47 

80 

127 

Total 

MEAN 

52.1 

58.4 

56 . 1 

SD 

9 .9 

8.5 

10.4 

aStandard  deviations  reported  in  this  table  describe  the  sample 
rather  than  the  population  (i.e.,  the  sums  of  squares  were  divided  by 
n rather  than  n-1) . 


would  be  to  further  reduce  the  value  of  the  F’s.  Since  the  principal 
reason  for  the  analysis  was  to  ascertain  that  the  versions  and  interac- 
tion effects  are  not  significant,  and  since  this  has  been  done,  no  further 
attempts  were  made  to  transform  the  scores. 
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Table  5 

Analysis  of  Variance  Summary  Table: 
Participation  by  Versions  for  SCAT  Total  Scores3 


Source 

DF 

Sums  of  Squares 

Mean  Square 

F 

A (Participation) 

T 

-L 

1578.7 

1578.7 

13.1** 

B (Version) 

6 

606.3 

101.0 

.8 

AB 

6 

643.8 

107.3 

.9 

error 

113 

13632.1 

120.6 

An  unweighted-means  analysis  was  used  (Winer,  1962). 

While  this  is  an  unfortunate  occurrence,  it  is  not  as  serious  as  if 
ability  had  interacted  with  the  versions  or  if  one  or  two  versions 
had  significantly  different  levels  of  ability  from  the  other  group. 

The  tendency  of  losing  the  lower  ability  students  from  the  study  will 
require  a redefinition  of  the  population,  and  will  perhaps  limit  the 
variance  due  to  ability.  Further,  the  distributions  of  SCAT  scores  may 
be  positively  skewed  because  of  the  loss  of  more  low  ability  students 
than  high  ability  ones. 

Attrition  may  have  also  influenced  the  distribution  across  versions 
of  students  having  different  previous  experiences  with  psychology. 

While  no  pre-test  was  given,  the  psychology  questionnaire  contained 
ten  questions  regarding  the  student* s previous  experiences  with  psycholo- 
gists and  with  psychology.  While  all  of  these  questions  may  not  have  the 
same  intrinsic  weight  or  importance , it  was  assumed  that  they  did  for  the 
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purpose  of  the  following  analysis.  The  number  of  questions  marked 
"yes"  by  each  student  was  used  in  a single  classification  analysis  of 
variance  to  determine  if  the  versions  differed  in  the  students'  back- 
grounds. The  means  and  standard  deviations  are  reported  in  Table  6 
(F-max  = 3.8,  K = 7 , df  = 17;  n.s.  at  p = .05)  and  the  summary  of  the 


Table  6 

Means  and  Standard  Deviations  of  Affirmative 
Answers  to  Ten  Psychology  Background  Questions 


Versions 

N 

Means 

Standard  Deviations3 

1 

9 

2.1 

1.3 

2 

12 

2.1 

1.3 

3 

16 

2.8 

1.4 

4 

14 

2.4 

1.8 

5 

16 

2.5 

1.7 

6 

11 

2.2 

1.3 

7 

18 

2.3 

1.3 

Total 

96 

3.4 

1.4 

^Standard  deviations  reported  in  this  table  describe  the  sample 
rather  than  the  population  (i.e.,  the  sum  of  squares  were  divided  by 
n rather  than  n-1) . 


analysis  is  shown  in  Table  7.  As  can  be  seen  from  the  F value,  there 
was  a tendency  for  more  variation  to  be  present  within  the  versions 


than  between  them. 
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Table  7 


Analysis  of  Variance  Summary  Table: 

Frequency  of  Affirmative  Answers  to  Psychology  Background  Questions3 


Source 

DF 

Sums  of  Squares 

Mean  Squares 

F 

Between 

6 

5.5 

.92 

.41  n.s. 

Within 

89 

200.7 

2.26 

Total 

95 

206.2 

aQues tionnaires  are  not  available  for  2 students,  so  the  analysis 
is  based  on  the  data  of  96  respondents. 


In  general,  individual  questions  received  affirmative  answers  too 
infrequently  to  be  analyzed  statistically.  For  example,  the  question; 
"Have  you  ever  been  taught  in  school  anything  about  psychology  (besides 
the  materials  which  you  have  jusc  completed)?"  received  a total  of  5 
affirmative  answers  representing  3 versions. 

Although  the  attrition  rate  was  unduly  high,  there  is  no  evidence 
that  it  has  resulted  in  unrepresentative  treatment  groups,  except  for 
making  them  more  homogeneous  in  regards  to  SCAT  scores.  Thus,  while  a 
much  lower  rate  of  attrition  would  have  been  desirable,  there  are  enough 
students  who  have  completed  the  study  to  provide  meaningful  information 
about  the  programmed  materials. 

Unfortunately  tangible  data  are  not  available  on  student  personality 
factors  nor  on  their  interest  in  the  project.  Consequently,  no  compari- 
sons can  be  made  between  the  students  who  dropped  out  and  those  who  com- 
pleted the  study  on  affective  and  personality  variables.  Therefore,  the 
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possibility  does  exist  that  the  treatment  groups  are  biased  on  these 
dimensions.  If  interest  in  psychology  was  greater  in  some  groups 
than  in  others  it  would  completely  distort  the  results  of  the  study. 

While  direct  evidence  is  not  available,  responses  to  two  questions 
on  the  followup  questionnaire  (see  Chapter  III)  give  some  indication 
that  the  dropouts  do  not  differ  appreciably  from  those  who  completed. 

The  first  question  is,  "If  you  were  requested  to  participate  in  a similar 
study  sometime  in  the  future,  would  you?"  Twenty-one  students  who 
participated  but  did  not  finish  responded  to  this  question  (out  of  a 
possible  53)  and  63  students  who  finished  responded  (out  of  a possible 
98).  Of  the  students  who  did  not  finish,  9 responded  with  "Yes"  and  12 
responded  with  either  "No"  or  "Don’t  Know"  (these  categories  were  com- 
bined because  of  the  small  number  of  "No"  responses — 3) . Of  those  who 
finished,  33  responded  "Yes"  and  30  responded  "No"  or  "Don't  Know." 
Calculation  of  x2  gives  a value  of  .57  (df  = 1)  which  is  not  significant 
(p  > . 40)  . 

The  second  question  is,  "Do  you  think  that  a reward  should  be 
given  to  those  who  participate?"  Of  the  students  who  did  not  complete 
the  study,  16  answered  "Yes"  and  6 answered  "No"  while  of  those  who  did 
finish,  44  answered  "Yes"  and  19  answered  "No."  The  x2  for  these  fre- 
quencies is  .07  (df  = 1)  which  is  also  not  significant  (p  > .70). 

If  the  students  who  finished  differ  substantially  from  those  who 
did  not,  it  would  be  expected  that  their  attitudes  toward  future  parti- 
cipation in  research  studies  would  also  differ.  Similarly,  it  would 
be  expected  that  those  who  were  least  satisfied  with  the  study  would 
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be  those  who  would  want  a "reward."  If  that  is  the  case,  then  satis- 
faction is  not  significantly  different  between  the  dropouts  and  those 
who  finished.  However,  the  situation  is  complicated  by  the  fact  that 
questionnaires  are  not  available  for  all  of  either  group  and  that  those 
which  were  returned  cannot  be  considered  representative  of  those  which 
were  not  returned.  If  not  returning  questionnaires  is  indicative  of  a 
negative  attitude  toward  the  study,  then  there  is  significant  evidence 
that  those  who  finished  do  differ  in  attitude  from  those  who  did  not. 
Twenty-three  out  of  53  (43%)  who  participated  but  did  not  finish  re- 
turned questionnaires  while  63  out  of  98  (64%)  who  finished  returned 
questionnaires,  x2  for  returning  questionnaires  based  upon  these 
frequencies  is  significant  (x2  = 6.15;  df  = 1;  p < .05),  indicating 
that  proportionately  fewer  persons  returned  the  followup  question- 
naire among  those  who  did  not  finish  than  among  those  who  did.  How- 
ever, since  the  questionnaire  was  sent  following  the  study,  the  differ- 
ent rate  of  questionnaire  returns  could  be  due  to  either  preexisting 
attitudes  or  it  could  be  a direct  result  of  some  aspect  of  the  study. 

The  available  information  is  not  sufficient  to  make  this  discrimination. 

The  most  important  concern  with  attitudes  is  whether  or  not  they 
differ  from  group  to  group.  The  data  for  the  above  questions  are  pre- 
sented in  the  analysis  section  (Chapter  IV) ; expected  cell  frequencies 
are  too  small  to  allow  calculation  of  x2's*  but  no  trends  are  evident. 
While  there  is  variation  among  the  treatment  groups , the  variation  is 
different  for  different  questions.  The  numbers  returning  question- 
naires in  each  group  are  listed  in  Chapter  IV.  A x2  f°r  those  return- 
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Ing  questionnaires  versus  those  not  returning  them  across  versions 
gives  a value  of  8.0  (df  = 6;  p > .20).  While  this  non-significant 
value  indicates  that  the  return  of  questionnaires  is  fairly  even 
across  versions,  it  is  confounded  by  the  fact  that  13  returned  ques- 
tionnaires could  not  be  identified  according  to  treatment  group  and 
are  excluded  from  the  x^.  These  could  be  distributed  proportionately 
across  version  or  could  be  heavily  concentrated  in  a few  versions. 

Also,  the  problem  still  exists  that  the  reasons  for  not  returning  the 
questionnaire  can  be  a direct  result  of  the  study  or  can  be  unrelated 
to  the  study.  Consequently,  little  can  be  concluded  about  attitudes. 


( 
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CHAPTER  III:  INSTRUMENT  DEVELOPMENT 

Three  instruments  were  developed  for  use  in  the  present  study. 

One  of  these  was  a 64-item  achievement  test,  the  second  a 14-item 
psychological  questionnaire,"  and  the  third  a questionnaire  con- 
taining items  concerning  the  attrition  rate  and  overall  attitudes 
toward  the  project.  The  development  of  these  instruments  is  des- 
cribed below  and  the  final  versions  are  in  Appendix  C. 

An  achievement  test  was  developed  to  measure  the  effectiveness 
of  the  instructional  material.  The  test  is  identical  for  each  ver- 
sion of  the  material  and  is  in  a multiple-choice  format.  The  test 
covers  only  Part  II  of  the  materials  and  not  Part  I.  Rather  than  pre- 
pare a complete  table  of  specifications  for  the  material,  the  nature 
of  the  programmed  instructional  material  being  in  small  frames  per- 
mitted a random  sampling  of  material  to  be  tested.  This  method  was 
certainly  more  efficient  than  preparing  a table  of  specifications  and 
it  probably  more  validly  represents  the  material.  In  order  to  equally 
represent  each  version  of  the  material,  108  of  the  original  frames  were 
randomly  selected  from  Part  II  of  the  material  — one  from  each  page. 

This  represents  1/8  of  the  total  number  of  frames.  Then  a multiple- 
choice  item  was  written  to  reflect  the  content  of  each  of  the  selected 
frames . 

The  completed  questions  were  then  submitted  to  two  judges  who  cri- 
tically read  them  to  detect  ambiguities,  inaccurate  statements,  and  poor 
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distracters.  Where  possible,  the  judges  suggested  methods  of  improving 
weak  questions. 

Since  the  test  had  to  be  given  within  a sixty-four-minute  period, 
only  64  questions  are  included  in  the  final  version.  The  64  questions 
represent  7.4%  of  the  total  possible  frames  in  Part  II.  These  questions 
are  allocated  in  such  a way  that  they  represent  also  7.4%  of  the  response 
frames  for  each  version  of  the  material  in  which  responding  is  required. 
Table  8 illustrates  the  numerical  allocation  of  these  questions. 


Table  8 

Frequency  of  Test  Frames  Corresponding  to 
Response  Frames  in  Each  Version 


No.  of  Frames 

Total  No. 

Test  Frames 
Coinciding 
with  Response 
Frames 
Before  Item 
Elimination 

Test  Frames 
Coinciding 
with  Response 
Frames 
After  Item 
Elimination 

Version 

per 

Response 

of  Response 
Frames 

Numb  er 

Percentage 

Number  Percentage 

1 

1 

864 

64 

7.4 

55  6.4 

2 

2 

432 

32 

7.4 

28  6.5 

3 

4 

216 

16 

7.4 

13  6.0 

4 

8 

108 

8 

7.4 

6 5.6 

5 

16 

54 

4 

7.4 

3 5.6 

6 

32 

27 

2 

7.4 

1 3.7 

NO  RESPONDING  REQUIRED 


7 
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Additionally,  each  of  the  eight  paginal  frame  positions  is  equally 
represented  by  eight  test  items. 

The  selection  of  the  questions  in  the  final  version  was  based 
upon  the  following  criteria: 

-*- • Each  of  the  responding  versions  must  be  equally  represented 
as  described  above, 

2.  Poor  quality  questions  as  indicated  by  the  judges  were  not 
used. 

3.  Questions  were  physically  distributed  as  much  as  possible 
across  the  material  for  each  of  the  responding  frames. 

inhere  these  criteria  were  not  adequate  to  choose  between  two  or  more 
questions,  the  choice  was  made  randomly. 

The  resultant  64  questions  were  then  randomly  ordered  for  the 
final  version  to  eliminate  the  existence  of  any  temporal  ordering  ef- 
fect m i_erms  of  the  amount  of  time  which  had  passed  since  that  part 
of  material  was  studied.  The  students  recorded  their  answers  to  the 
questions  on  mark— sense  cards  which  were  electrically  scored. 

After  administration  of  the  test,  the  questions  were  subjected  to 
an  item  analysis  so  that  weak  and  nondis criminatory  items  could  be  eli- 
minated. A difficulty  index  was  calculated  for  each  item  using  the 
scores  of  all  of  the  students.  Then  a discrimination  index  was  calcu- 
iated  ior  each  item  by  subtracting  the  number  of  correct  responses  in 
tne  lower  27 L or  the  students  from  the  number  of  correct  responses  in 
che  upper  27/,  of  the  students,  and  then  dividing  this  difference  by  the 
maximum  possible  difference.  These  two  indices  are  reported  for  each 
item  in  Table  9. 


While  there  are  many  item  analysis  techniques  which  are  more  so- 
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Table  9 


Item  Analysis  of  the  Psychology  Achievement  Test3 


Item 

Difficulty*5 

Index 

Discrimination0 

Index 

Item 

Difficulty*5 

Index 

Q 

Discrimination 

Index 

1 

57 

.44 

33 

76 

.33 

2 

85 

.41 

34 

42 

.30 

3 

88 

.33 

35 

77 

.44 

4 

64 

.59 

36 

6] 

.61 

• 5 

69 

.48 

37 

74 

.48 

6 

67 

.67 

38 

36 

.41 

7 

25 

.33 

39 

64 

.63 

8 

44 

.74 

40 

58 

.63 

*9 

92 

.18 

41 

63 

.33 

10 

57 

.59 

*42 

10 

0 

11 

42 

.52 

43: 

65 

.52 

12 

82 

.44 

44 

68 

.48 

13 

52 

.59 

45 

82 

.33 

14 

80 

.48 

*46 

94 

.07 

15 

86 

.37 

47 

74 

.70 

16 

28 

.59 

48 

67 

.74 

17 

71 

.52 

49 

54 

.74 

18 

75 

.44 

50 

40 

.59 

19 

77 

.30 

51 

77 

.67 

20 

61 

.59 

52 

71 

.41 

21 

54 

.78 

53 

68 

.67 

22 

80 

.30 

54 

68 

.59 

*23 

88 

.18 

*55 

65 

.26 

24 

82 

.33 

56 

59 

.67 

25 

71 

.37 

*57 

97 

.15 

*26 

89 

.26 

*58 

93 

.22 

27 

40 

.48 

59 

61 

.41 

28 

54 

.85 

60 

79 

.52 

29 

56 

.78 

61 

79 

.44 

30 

35 

.37 

62 

56 

.70 

31 

58 

.56 

63 

36 

.41 

32 

63 

.63 

*64 

93 

.22 

Based  on  scores  of  98  students;  standard  deviation  (before  item  elimi- 
nation) is  11.7. 

^Difficulty  index  is  percentage  of  all  students  getting  item  correct, 
discrimination  index  is  found  by  where  U is  the  number  of  students 
in  the  upper  27%  getting  the  item  correct,  L is  the  number  of  students 
in  lower  27%  getting  item  correct,  and  N is  27%  of  the  students. 

* Indicates  items  with  a discrimination  index  less  than  .30.  These  items 
were  eliminated  from  the  final  test  version. 
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A discrimination  index  of  .30  or  above  was  selected  as  the  cut- 

2 

off  criterion  for  the  retention  of  test  items. 

The  difficulty  index  was  not  used  to  eliminate  items  because  the 
extremely  easy  and  the  extremely  difficult  items  contribute  very  little 
to  determining  the  composition  of  the  top  and  bottom  27%.  Further,  the 
discrimination  index  removes  the  items  of  high  and  low  difficulty  as 
well  as  the  other  items  which  do  not  discriminate.  Using  the  discrimi- 
nation index  criterion,  nine  items  were  eliminated.  Each  of  these  items 
is  marked  with  an  asterick  in  Table  9.  After  removal  of  the  nine  items, 
the  membership  of  the  top  and  bottom  27%  changed  by  only  one  student , 
which  indicates  that  there  is  no  need  to  recalculate  the  discrimination 
index  based  on  the  revised  scores. 

The  effect  of  eliminating  these  items  on  the  representative  sampling 
of  frames  is  shown  in  Table  8.  Student  scores  were  then  calculated  on 
the  basis  of  the  remaining  55  items  and  used  in  the  analysis  reported 
in  Chapter  IV. 


phisticated  than  the  discrimination  index,  there  is  evidence  that  the 
use  of  a discrimination  index  is  quite  effective.  Engelhart  (1965) 
empirically  compared  discrimination  indices  based  on  the  upper  and  low- 
er thirds  and  on  a median  split  with  tetrachoric,  phi,  biserial  and 
point-biserial  correlations  and  with  the  Davis  method.  The  discrimina- 
tion indices  were  extremely  effective  and  gave  a better  indication  of 
the  actual  number  of  discriminations  than  did  the  correlational  tech- 
niques. 


2 % 

Ebel  (1965)  recommends  that  items  with  indices  of  discrimination  between 
.20  and  .29  are  "marginal  items,  usually  needing  and  being  subject  to  im- 
provement, and  that  items  with  indices  below  .19  are  "poor  items  to  be 
rejected  or  improved  by  revision."  Five  items  fall  in  the  former  category 
and  four  in  the  latter. 
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Since  the  test  was  administered  only  once,  test-retest  reliability 
could  not  be  calculated.  Instead,  internal  consistency  was  determined 
using  Kuder-Richardson  formula  20,  as  follows: 

r = _n_  st2  l Pjqi 
n-1  St2 

where : 

n is  the  number  of  items, 

Sfc2  is  the  variance  of  the  test  scores, 

is  the  proportion  of  correct  responses  on  Item  i 
and  q^  is  the  proportion  of  incorrect  responses  on  Item  i. 

Using  this  formula,  the  reliability  was  found  to  be  .94..  This  coeffi- 
cient is  considered  to  be  more  than  adequate  for  the  test  to  be  con- 
sidered reliable. 

No  attempt  was  made  to  determine  criterion-related  validity  of 
the  instrument,  since  it  is  an  achievement  test  with  high  apparent  con- 
tent validity.  Each  question  was  written  to  reflect  the  content  of  a 
specific  randomly  selected  frame.  Since  the  internal  consistency  was 
relatively  high,  it  is  reasonable  to  assume  that  the  test  tends  to  uni- 
formly measure  the  content  of  the  programmed  materials,  which  implies 
high  content  validity. 

The  second  instrument  used  in  the  study  was  a "psychology  question- 
naire," which  was  designed  to  serve  two  purposes.  Since  a pretest  was 
not  given  prior  to  the  study  in  order  to  avoid  practice  effects  and  to 
avoid  establishing  a set  for  the  material  to  follow,  a crude  attempt  was 
made  in  this  questionnaire  to  determine  if  the  students  had  any  prior 
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acquaintance  with  psychology.  Thus,  10  of  the  questions  on  the  student 
questionnaire  reflect  this  end.  The  remaining  four  questions  attempt  to 
assess  the  students'  reaction  to  the  material.  No  validity  or  reliability 
data  are  available  for  any  of  these  questions  and  they  should  be  regarded 
only  as  an  indication  of  the  students'  reactions.  Hence,  they  should  be 
interpreted  cautiously.  The  items  were  worded  in  question  form  and  the 
students  responded  by  circling  "Yes"  or  "No"  to  each  question. 

The  rationale  for  using  "Yes-No"  questions  (i.e.,  binary  choice) 
rather  than  some  other  type  as  5-point  Likert  scales,  is  based  in  part 
upon  the  nature  of  some  of  the  questions.  For  example,  the  question,  "Is 
your  father  or  mother  a psychologist?"  presents  only  binary  alternatives. 
Other  items,  however,  could  be  used  with  a larger  number  of  rating  points. 
If  the  questionnaire  was  composed  of  these  two  types  of  items,  it  would 
be  difficult  if  not  impossible  to  create  a meaningful  composite  of  the 
items.  There  is  no  justification  for  summing  scales  with  differing  num- 
bers of  points. 

For  the  items  dealing  with  attitudes  toward  the  material,  scales 
utilizing  more  than  two  points  could  be  used.  The  variance  would  be  in- 
creased and  finer  discriminations  could  be  made  among  individuals  than 
with  the  two  point  scale.  However,  the  Broward  County  School  Board  has 
adopted  a strong  position  against  psychological  and  attitudinal  testing. 
Part  of  the  testing  policy  adopted  in  June,  1967  reads  as  follows:  "No 
tests  other  than  standardized  intelligence,  aptitude  and  achievement  tests 
shall  be  scheduled  or  administered  without  the  express  consent  of  the 
Board."  The  policy  also  provides  that  if  a test  is  approved,  parents 
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must  be  notified  by  mail  on  School  Board  letterhead  of  the  testing  and 
must  be  given  the  opportunity  to  inspect  the  instrument  prior  to  the  test- 
ing. Further,  it  gives  the  parents  the  right  to  remove  their  children 
from  the  testing.  Faced  with  these  restrictions,  it  was  decided  that  a 
simple  "Yes-No"  Questionnaire  could  be  administered  without  objections, 
whereas  a sophisticated- looking  rating  scale  could  not.  Thus,  both  the 
limited  number  of  items  and  the  binary  response  mode  reflect  the  local 
ban  on  psychological  tests. 

These  same  considerations  are  also  reflected  in  the  third  instrument 
which  has  a "Yes-No"  format.  The  primary  purpose  of  the  third  in- 
strument was  to  ascertain  the  reasons  for  attrition.  (The  attrition 
questions  and  the  results  are  listed  in  Appendix  A.)  The  other  questions 
are  largely  based  on  concerns  raised  in  conversations  with  the  students 
in  the  study  and  on  indirect  feedback  from  the  students.  These  question- 
naires were  given  to  the  students  after  the  completion  of  the  study  and 
were  returned  to  the  investigator  by  mail.  In  an  attempt  to  secure  a 
good  rate  of  return  and  to  facilitate  frankness,  the  students  had  the 
option  of  not  putting  their  names  on  them.  Consequently,  some  of  the 
questionnaires  cannot  be  identified  and  they  are  excluded  from  analyses 
where  identification  is  necessary  (e.  g.,  comparisons  across  versions). 


CHAPTER  IV:  DATA  ANALYSIS 

Although,  there  seems  to  be  no  difference  among  the  final 
samples  on  the  various  independent  variables  (see  Chapter  II) , the 
unexpectedly  high  attrition  rate  for  the  subjects  nevertheless 
has  limited  the  possible  analyses  and  interpretations  because  of 
the  reduced  n's.  For  example,  interaction  designs  which  can  be 
employed,  must  be  limited  to  two  way  designs.  Also,  of  course, 
the  low  n’s  reduce  the  stability  of  the  estimated  parameters  used 
in  the  analyses.  Further,  since  the  resultant  cell  sizes  are 
unequal,  homogeneity  of  variance  and  normal  error  distribution 
cannot  be  assumed. 

Criterion  Test  Scores 

j.able  10  presents  the  resultant  sample  sizes  by  material  ver- 
sions and  by  sex  along  with  the  means  and  standard  deviations  for 
the  psychology  achievement  test.  As  can  be  seen,  the  n's  vary  from 
a low  of  3 to  a maximum  of  9.  An  inspection  of  the  standard  devia- 
tions especially  when  broken  down  by  sex  — strongly  suggests  that 
the  samples  do  not  have  homogeneous  variances.  If  the  sample  sizes 
were  equal,  a lack  of  homogeneity  would  not  seriously  affect  the  F- 
test  in  an  analysis  of  variance.  However,  since  the  sample  sizes 
do  vary  greatly,  Bartlett's  test  of  homogeneity  was  used.  The  cal- 
culated value  of  x2  by  this  test  is  7.05  which  is  not  significant 
(d.f.  ••  13,  p > .80).  This  indicates  that  homogeneity  of  the 
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Table  10 


Means  and  Standard  Deviations3  of  the  Psychology  Achievement  Test 

by  Version  and  Sex 


Version 

Male 

Female 

Total 

N 

4 

6 

10 

1 

MEAN 

41.0 

35.3 

37.6 

SD 

12.0 

13.0 

12.9 

N 

5 

7 

12 

2 

MEAN 

35.8 

38.9 

37.6 

SD 

8.8 

9.4 

9.3 

N 

9 

7 

16 

3 

MEAN 

30.3 

34.0 

31.9 

SD 

9.3 

10.3 

9.9 

N 

7 

7 

14 

4 

MEAN 

32.7 

33.4 

33.1 

SD 

7.8 

14.0 

11.3 

N 

9 

7 

16 

5 

MEAN 

29.2 

35.0 

31.8 

SD 

15.1 

8.5 

12.9 

6 

N 

3 

9 

12 

MEAN 

25.7 

40.2 

36.6 

SD 

16.4 

8.2 

12.5 

N 

9 

9 

18 

7 

MEAN 

31.7 

35.8 

33.7 

SD 

8.9 

12.3 

10.9 

Total 

N 

46 

52 

98 

MEAN 

32.0 

36.2 

34.2 

SD 

11.7 

11.2 

11.6 

aStandard  deviations 

reported  in 

this  table 

describe  the  sample 

rather  than 

the  population 

(i. e. , the 

sums  of  squares  were  divided 

by  n rather 

than  n-1) . 

variances  is  probably  present. 

A second  problem,  also  due  to  the  attrition  rate,  is  that  nor- 
mality cannot  necessarily  be  assumed  for  the  samples.  The  F-test  is 
not  greatly  affected  by  non-normal  distributions  when  the  sample 
sizes  are  relatively  large,  but  unfortunately,  the  sample  sizes  are 
rather  small.  However,  tests  of  homogeneity  of  variance  such  as 
the  Bartlett  test  used  are  usually  extremely  sensitive  to  departures 
from  normality  (Hays,  1963).  Hence,  since  Bartlett's  test  produced 
a non-significant  x2  it  is  reasonable  to  assume  that  the  distribu- 
tions approximate  normality  and  that  they  will  not  greatly  affect 
the  F-test.  Still,  in  drawing  conclusions  from  the  data,  it  must  be 
recognized  that  samples  with  small  n's,  especially  those  of  3 or  4, 
cannot  closely  represent  normal  distributions. 

Since  the  assumptions  of  the  F-test  appear  to  have  been  met,  a 
two-way  interaction  (sex  by  version)  analysis  of  variance  was  com- 
pleted for  the  psychology  achievement  test  scores.  The  summary  table 
for  this  analysis  is  presented  in  Table  11.  As  can  be  seen  from  the 
table,  neither  the  main  effects  nor  the  interaction  are  significant. 

Tn  fact,  for  the  versions'  main  effect  and  for  the  interaction  term, 
the  mean  squares  are  smaller  than  the  error  term.  This  phenomenon  in- 
dicates that  the  variance  due  to  the  differences  among  the  individuals 
tended  to  be  greater  than  the  variance  due  to  the  effects  of  having 
different  versions  of  the  material.  In  other  words,  even  with  the 
variance  due  to  sex  removed,  the  seven  versions  of  the  material  did 
not  cause  differential  performance  on  the  achievement  test. 
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Table  11 

Analysis  of  Variance  Summary  Table 
Sex  X Versions  for  Psychology  Achievement  Test  Scores 


Source 

DF 

Sums  of  Squares3  Mean  Square 

F 

A (sex) 

1 

308.4 

308.4 

2.17 

N.S 

B (version) 

6 

467.9 

78. G 

.55 

N.S. 

AB 

6 

687.0 

114.5 

.81 

N.S. 

error 

84 

11933.3 

142.1 

aAn  unweighted-means  analysis  was  used  (Winer,  1962). 


A possible  factor  Influencing  the  criterion  test  scores  could  be 
ability  if  it  is  differentially  distributed  across  versions.  Table  5 
in  Chapter  II  presents  a summary  of  an  interaction  analysis  of  SCAT 
total  scores,  in  which  no  difference  was  found  for  the  versions*  main 
effect  or  tor  the  interaction  of  versions  with  the  participation  vari- 
able. Thus,  it  would  be  unlikely  that  the  SCAT  scores  are  differen- 
tially distributed  across  the  versions. 

Nevertheless,  as  a further  check,  an  analysis  of  covariance 
was  made  using  the  SCAT  total  scores  as  the  covariate.  The  Pearson 
correlation  of  the  criterion  test  scores  with  the  SCAT  total  test 
scores  for  the  80  students  for  whom  SCAT  scores  were  available  is 
.58.*  For  an  N of  80,  this  value  is  significant  beyond  the  .001 

* The  correlation  of  the  criterion  test  scores  with  the  SCAT  verbal 
scores  is  .62  and  with  the  SCAT  quantitative  test  scores  is  .35.  Since 
the  correlation  coefficients  of  the  verbal  and  of  the  total  SCAT  scores 
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level,  indicating  a reasonably  strong  relationship  between  the  SCAT 
and  the  psychology  achievement  test  scores  (the  coefficient  of  deter- 
mination is  .34). 

The  relationship  suggests  that  an  analysis  of  covariance  using 
the  SCAT  scores  as  the  covariate  might  be  appropriate  and  might  permit 
the  effects  of  different  versions  to  be  detected  after  the  effect  of 
intelligence  (as  measured  by  the  SCAT)  has  been  statistically  removed. 
The  analysis  of  covariance,  then,  will  statistically  equalize  the 
group  on  this  variable  if  they  do  differ. 

Although  the  factor  of  sex  was  not  significant  in  the  above 
analysis  of  variance  and  the  point  biserial  correlation  of  sex  with  the 
SCAT  total  scores  is  .04,  sex  could  still  interact  with  intelligence  to 
influence  the  achievement  test  scores.  However,  reference  to  the  Table 
12  will  show  that  the  resultant  n for  males  in  version  6 is  2,  and  that 
four  other  cells  have  n's  of  4.  Consequently,  if  the  analysis  of 
covariance  was  computed  with  sex  as  a factor  in  an  interaction  design, 
some  of  the  regression  lines  would  be  based  on  extremely  small  and 
probably  highly  unreliable  samples.  Sex,  then,  is  ignored  as  a factor, 
and  the  analysis  of  covariance  is  completed  only  for  the  versions  in  a 


are  essentially  identical,  the  verbal  scores  could  probably  be  used 
interchangeably  with  the  total  score  in  the  analysis  of  covariance. 

The  total  was  chosen  to  use  because  it  can  be  regarded  as  a measure 
of  general  intelligence,  correlating  with  the  WAIS  total  at  about  .84 
(Green,  1965). 

The  internal  consistency  reliability  of  the  criterion  test  was 
reported  in  Chapter  III  to  be  .94  and  the  internal  consistency  of  the 
SCAT  is  .95  (Green,  1965).  Consequently,  because  of  these  high  relia- 
bilities, correcting  the  correlation  coefficient  of  .58  for  attenuation 
yields  a coefficient  of  only  .61. 


Table  12 


The  Number  of  Students  by  Sex  and  Version  for  Whom  Both  the 
Achievement  Test  Scores  and  SCAT  Scores  Were  Available. 


Version 

Male 

Sex 

Female 

Total 

1 

4 

4 

8 

2 

4 

6 

10 

3 

8 

4 

12 

4 

7 

7 

14 

5 

7 

5 

12 

6 

2 

8 

10 

7 

7 

7 

14 

Total 

39 

41 

80 

single  classification  design. 

Table  13  gives  the  means  and  standard  deviations  of  the  SCAT 
total  and  of  the  criterion  test  for  the  samples  used  in  the  analysis 
of  covariance.  Hartley's  maximum  F - ratio  was  used  to  test  for  homo- 
geneity of  variance.2  For  the  SCAT  the  ratio  of  the  largest  to  the 


2Hartley's  test  assumes  that  the  degrees  of  freedom  are  equal  for 
all  cells  which  is  not  true  for  the  data  in  this  case.  Consequently, 
in  determining  the  critical  value  of  F-max,  the  largest  number  of  de- 
grees of  freedom,  13,  was  used  to  enter  the  table.  This  procedure 
should  have  the  effect  of  increasing  the  size  of  alpha  beyond  the  .05 
level,  and  thus,  increasing  the  likelihood  of  rejecting  the  null  hypo- 
thesis of  equal  variances. 
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Table  13 

Means  and  Standard  Deviations3 


of 

the  SCAT  and 

the  Achievement 

Test 

Version 

N 

SCAT 

Achievement 

Mean 

SD 

Mean 

SD 

1 

8 

56.2 

8.5 

35.5 

13.5 

2 

10 

61.3 

7.5 

38.8 

9 . 6 

3 

12 

57.2 

7.3 

32.9 

9.5 

4 

14 

61.0 

12 . 6 

33.1 

11.3 

5 

12 

57.6 

8.0 

35.2 

12.7 

6 

10 

58.0 

7.7 

38.8 

10.2 

7 

14 

57.1 

8.3 

34.4 

8.7 

Total 

80 

58.4 

8.9 

35.3 

10.8 

Standard  deviations  reported  in  this  table  describe  the  sample 
rather  than  the  population  (i.e.,  the  sum  of  squares  were  divided  by  r, 
rather  than  n-1) . 
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smallest  variance  is  3. 0 (K  = 7 , df  = 13)  which  is  not  significant 
(p  > .05).  For  the  criterion  test,  the  ratio  is  2.4  (K  = 7,  df  = 13) 
which  also  is  not  significant  (p  > .05).  Consequently,  it  is  assumed 
that  the  variances  are  homogeneous . ^ 

A test  was  made  for  homogeneity  of  within  sample  regression  and 
the  calculated  F is  .97  (df  = 6,  66),  which,  of  course,  is  not  signifi- 
cant. Consequently,  it  is  assumed  that  the  regression  is  homogeneous 
and  the  treatment  means  were  tested  for  differences.  The  summary  of 
the  analysis  of  covariance  is  presented  in  Table  14.  The  calculated 


Table  14 

Analysis  of  Covariance  for  the  Psychology  Achievement 


Test 

Scores 

with  SCAT  Scores  as 

the  Covariate. 

Source 

DF 

Sums  of  Squares 

Mean  Square  F 

Treatments 

6 

453.6 

75.6  .91  N.S. 

Error 

72 

5983.8 

83.1 

Total 

78 

6437.4 

value  of  F is  .91 

. which 

is  not  significant. 

Thus,  it  seems  that  the 

SCAT  scores  are  equally  distributed  across  versions,  since  removing 
their  effects  has  no  significant  influence  on  the  treatment  means. 


^Footnote  5 in  Chapter  II  stated  that  the  variances  of  the  SCAT 
scores  were  not  homogeneous.  However,  in  that  test  the  smallest  and 
largest  variances  were  both  in  the  non-completion  groups.  These  groups 
were,  of  course,  not  included  in  the  analysis  of  covariance. 

Earlier  in  this  chapter,  it  was  reported  that  Bartlett's  test  failed 
to  detect  heterogeneity  of  variance  for  the  criterion  test.  A second  test 
was  performed  here  because  only  80  subjects  are  used  for  the  analysis  of 
covariance  instead  of  all  98. 
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Compietion  Time 

Completion  times  to  work  through  the  programmed  materials  were 
recorded  in  minutes  for  each  student  to  the  nearest  five  minutes.  These 
times,  however,  should  be  considered  as  a rough  estimate  because  of 
errors  in  the  recording.  For  example,  one  day  the  monitor  failed  to 
record  the  times  for  the  majority  of  the  students.  These  times  were 
subsequently  estimated  from  the  students*  own  records  and  from  the 
times  the  students  were  originally  scheduled  to  work  with  the  project. 

The  means  and  standard  deviations  for  each  of  the  versions  are 
presented  in  Table  15.  The  F-max  statistic  was  used  again  to  test 

Table  15 


Means  and  Standard  Deviations  of  Completion  Times  in  Minutes 


Version 

N 

Mean 

Standard  Deviation3 

1 

10 

347.0 

51.3 

2 

12 

306.2 

60.8 

3 

16 

281.6 

54.9 

4 

14 

240.4 

63.7 

5 

16 

231.9 

64.2 

6 

12 

260.0 

48.0 

7 

18 

276.1 

70.9 

Total 

93 

aStandard  deviations  reported  in  this  table  describe  the  sample 
rather  than  the  population  (i.e.9  the  sum  of  squares  were  divided  by 
n rather  than  n-1) . 
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for  homogeneity  of  variance.  The  calculated  value  is  1.9  (K  = 7, 
df  - 17)  which  is  not  significant  (p  > .05).  An  analysis  of  variance 
was  then  performed  on  the  data  and  the  results  are  reported  in  Table  16. 


Table  16 

Analysis  of  Variance  for  Completion  Time 


Source 

DF 

Sums  of  Squares 

Mean  Square 

F 

Between 

6 

105515.6 

17585.9 

4.31** 

Within 

91 

371096.9 

4077.9 

Total 

97 

476612.5 

**  Indicates  significance  at  p < .01 


The  F value  of  4.31  (df  = 6,  91)  is  highly  significant  (p  < .01). 
This  finding  was  anticipated  because  it  was  assumed  that  it  would  take 
longer  to  complete  material  which  required  a lot  of  overt  responding 
than  it  would  to  complete  with  little  or  no  overt  responding.  However, 
inspection  of  the  distribution  of  means  does  not  permit  that  simple  an 
explanation.  The  means  decrease  from  a high  for  version  1 to  a low 
for  version  5,  and  then  increase  again.  In  order  to  determine  which 
differences  among  the  means  are  statistically  significant,  a compari- 
son among  the  means  was  made  using  the  test 
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where 

X and  X are  the  adjusted  treatment  means, 
12 
2 

EX  is  the  total  sum  of  squares  for  X and 

O 

the  within  sum  of  squares  for  X.1* 


Table  17 

Differences  Between  Completion  Time  Means 


Version 

1 

2 3 

4 

5 

6 

7 

Means  N 10 

12  16 

14 

16 

12 

18 

*** 

*** 

** 

1 

347.0 

40.8  65.4* 

100.1 

108.9 

87.0 

70.9  * 

2 

306.2 

24.6 

59.3* 

68.1* 

46.2 

30.1 

3 

281.6 

- 

34.7 

43.5 

21.6 

5.5 

4 

246.9 

- 

8.8 

13.1 

29.2 

5 

238.1 

- 

21.9 

38.0 

6 

260.0 

- 

16.1 

7 

276.1 

- 

* Indicates 

significance 

at  p < 

.05 

**  Indicates 

significance 

at  p < 

.01 

***  Indicates 

significance 

at  p < 

.001 

The  results  of  the  comparisons  are  presented  in  Table  17,  The 
differences  between  the  mean  for  version  5 and  those  for  version  6 and 
7 are  not  significant.  However,  the  finding  of  increasing  completion 


This  test  statistic  is  discussed  in  Winer  (1962) , Cochran  and 
Cox  (1957),  Snedecor  (1956),  and  other  books  dealing  with  multiple 
group  comparison. 
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time  after  Version  5 still  may  be  a true  function  of  the  versions 
rather  than  attributable  to  sampling  error. 

Descriptively,  the  correlation  ratio  nwill  yield  an  index  of 
the  strength  of  the  relationship  regardless  of  the  shape  of  that 
relationship.  Using  the  information  in  Table  16,  q is  calculated 
by  taking  the  square  root  of  the  ratio  of  the  between  sums  of  squares 
to  the  total  sums  of  squares.  This  value  is  .47. 

Pearson  product-moment  correlation  measures  the  strength  of 
the  linear  relationship  between  two  variables.  For  the  relationship 
of  versions  and  completion  time,  the  Pearson  coefficient  is  -.30 
(the  negative  sign  is  merely  a function  of  the  numbering  of  the  versions) 
The  test  of  the  null  hypothesis  that  q = r gives  an  F value  of  3.03 
(df  = 5,  91)  which  is  significant  at  the  .05  level^  This  implies 
that  the  relationship  between  versions  and  completion  time  is  best 
described  by  a curvilinear  function.  Unfortunately,  q's  main 
importance  is  as  a description  of  the  sample  and  not  as  an  estimate 
of  the  population  association  (Hays  , 1963) . The  calculation  of  q 
has  only  confirmed  the  obvious  fact  that  the  completion  time  is  not 
linearly  related  to  versions. 

The  estimation  of  the  population  relationship  and  the  determina- 
tion of  the  best  fitting  function  is  usually  done  by  trend  analysis. 
However,  trend  analysis  normally  assumes  equal  intervals  between  the 
levels  of  the  independent  variables.  Such  as  assumption  cannot  be 
made  with  the  material  versions  because  the  frame  size  doubles  for 
each  successive  version.  Because  the  relationship  of  successive  ver- 


5 F=  [(h2-r2)  / (n  - 2)]  /[(l  - q2)  / (N-n) ] 
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sions  is  known,  it  would  seem  reasonable  that  a corrective  function 
could  be  developed  a priori  to  equalize  the  intervals.  Thus,  if  the 
logical  assumption  of  a linear  relationship  prevailed,  some  equation 
such  as 

T = R + f (F) 

could  be  utilized,  where  T is  the  completion  time,  R is  the  reading 
time  and  f(F)  is  a function  of  the  number  of  response  frames  (F)  in 
the  material.  The  logical  assumption  would  be  that  f(F)  would  actually 
be  the  time,  t,  required  to  make  a response  multiplied  by  the  number  of 
responses  (i.e.,  tF) . Then  version  7 would  have  completion  time,  R, 
since  no  responding  is  required;  version  6 would  have  completion  time, 

R + 27t,  since  there  are  27  elicited  responses;  version  5's  completion 
time  would  be  R + 54t  and  so  on.  (Since  the  test  of  n = r indicates 
curvilinearity,  the  function,  f(F)  is  probably  not  as  simple  as  tF. 
Rather,  some  function  such  as  tFP  where  p is  an  unknown  exponent  would 
be  more  likely) . 

Unfortunately,  this  simple  relationship  does  not  work  because  the 
completion  for  versions  4,  5,  and  6 are  all  less  than  that  for  version  7. 
Thus,  if  the  model  is  true,  the  data  imply 

R > R + 27t  > R + 54t  > R + 108t 
and  R < R + 2l6t  < R + 432t  < R + 864t 

These  contradictory  relationships  are,  of  course,  not  possible.  Even 
without  the  contradictions,  the  first  set  of  inequalities  requires  t, 
the  time  to  respond,  to  be  a negative  quantity,  which  of  course,  is  also 
impossible. 

Consequently,  as  was  suggested  previously  by  Hi  the  simplest 


a priori  relationship  between  completion  time  and  versions  does  not 
fit  the  data,  and  a simple  correction  for  the  lack  of  equal  intervals 
is  not  available.  Whether  the  curved  nature  of  the  completion  time 
is  due  to  extraneous  sources  of  variation  such  as  sampling  errors  or 
to  the  effects  of  the  frame  sizes  cannot  be  discerned  from  the  pre- 
sent study.  All  that  can  be  concluded  from  the  comparisons  reported 
in  Table  17  is  that  the  versions  requiring  a high  frequency  of  respond- 
ing (i.e.,  versions  1 and  2)  do  have  significantly  longer  completion 
times  than  do  versions  requiring  lower  completion  times. 

Error  rate 

Each  student  kept  a record  of  the  numbers  of  errors  he  made  in 
working  through  Part  II  of  the  material.  Since  the  frequency  of  re- 
sponding varied  with  the  material  version,  the  error  rate  is  in  part 
a function  of  the  version.  The  raw  error  means  for  each  version  are 
reported  in  Table  18.  Inspection  of  the  table  shows  that  the  mean 
errors  decrease  as  the  number  of  responses  decrease  except  for  ver- 
sion 2 which  increases  slightly.  In  order  to  allow  for  the  different 
maximum  number  of  errors,  the  errors  were  converted  to  percentages 
of  responses  as  shown  in  Column  5 of  the  table.  Expressed  as  per- 
centages, the  means  increase  steadily  across  versions  to  a high  for 
version  6. 

In  order  to  better  understand  the  function  of  the  errors,  the 
correlations  of  both  the  raw  scores  and  the  percentage  errors  with  the 
criterion  test  were  calculated.  Between  raw  score  errors  and  the 
criterion  test,  the  Pearson  correlation  coefficient  is  -.16,  which 
is  not  significant  (df  = 96,  p>  .05).  Between  percentage  errors 
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Table  18 


Means  and 

Standard  Deviation3 

of  Errors 

Made  in 

the  Programmed 

Materials 

Version*3  Number  of 

Responses 

Raw  Error 
Mean  SD 

Percent 

Mean 

Error 

SD 

1 

864 

36.4 

22.5 

4.2 

1.1 

2 

432 

36.9 

33.9 

8.3 

7.9 

3 . 

216 

21.4 

18.8 

9.9 

8.9 

4 

108 

11.5 

8.5 

10.6 

7.8 

5 

54 

8.1 

4;  8 

15.0 

8.9 

6 

27 

4.2 

4.5 

15.6 

16.7 

aStandard  deviations  reported  in  this  table  describe  the  sample 
rather  than  the  population  (i.e.,  the  sum  of  squares  were  divided  by 
n rather  than  n-1.). 


^Version  7 is  omitted  since  no  responses  were  required. 

and  the  criterion  test,  the  correlation  is  -.36,  which  is  significant 
beyond  the  .001  level.  Additionally,  the  errors  were  converted  to 
T-scores  (mean  =50,  sd  = 10)  within  each  version,  and  the  correlation 
with  the  criterion  test  was  found  to  be  -.32,  which  is  also  significant 
(p  < .01).  These  correlations  indicate,  then,  that  a negative  relation- 
ship exist  between  the  errors  made  by  a student  and  his  score  on  the 
criterion  test.  However,  even  with  the  significant  correlations,  only 
13%  of  the  variance  of  the  criterion  test  is  in  conraon  with  the  per- 
centage errors  (only  10%  with  the  T-scores) . 

The  number  of  errors  made  by  an  individual  should  be  in  part  a 
function  of  his  mental  ability.  Thus,  at  least  some  of  the  relation- 
ship between  errors  and  the  criterion  test  may  be  due  to  intelligence. 
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The  correlation  between  percentage  of  errors  and  the  SCAT  total  scores 
is  -.25.  A correlation  coefficient  of  .58  between  SCAT  total  scores 
and  the  criterion  test  scores  was  reported  earlier  in  this  chapter. 
Removing  the  correlation  with  the  SCAT,  the  partial  correlation  between 
errors  (percentage)  and  the  criterion  test  is  - .27.  While  this  value 
is  still  significant  (p  < .01)  the  variance  held  in  common  is  reduced 
to  about  7%.  Considering  the  relatively  small  magnitude  of  the  relation- 
ship between  errors  and  the  criterion  test,  it  appears  that  error  rate 
alone  is  a poor  predictor  of  performance  on  the  criterion  test.  The 
intercorrelation  of  the  various  variables  is  discussed  further  in  the 
next  section. 

Intercorrelation  of  Dependent  and  Independent  Variables 

Intercorrelation  coefficients  for  the  criterion  test,  completion 
time,  and  percentage  errors  along  with  six  independent  variables  are 
presented  in  Table  19.  The  correlations  of  the  criterion  test  scores 
with  other  variables  are  particularly  interesting.  The  lowest  correla- 
tion (-.03)  is  with  versions  which  gives  further  indication  of  the  lack 
of  a relationship  between  the  test  and  the  frame  sizes.  On  the  other 
hand,  the  criterion  test  has  significant  correlations  with  the  three 
SCAT  scores,  with  sex,  and  with  percentage  errors.  These  coefficients 
suggest,  as  discussed  previously,  that  individual  differences  are  the  most 
important  determiners  of  the  test  scores. 

The  correlation  (-.36)  with  percentage  errors  is  difficult  to  ex- 
plain, since  percentage  errors  correlate  significantly  (.34)  with  ver- 
sions even  though  the  criterion  test  and  versions  have  essentially  no 
correlation.  If  a multiple  R was  being  calculated  with  the  criterion 
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Table  19 

Intercorrelations  of  the  Criterion  Variables 
With  Several  Independent  Variables3 


Criterion 

Variables 

Independent  Variables 

1 

2 

3 

4 

5 

6 7 

8 

9 

1.  Criterion 
Test 

.11 

(98) 

-.36** 

(80) 

.62** 

(80) 

.35** 

(80) 

.58**  -.23* 
(80)  (98) 

-.16 

(98) 

-.03 

(98) 

2.  Completion 
Time  (minutes) 

-.01 

(80) 

-.21 

(80) 

-.24* 

(80) 

-.21  -.02 
(80)  (98) 

-.12 

(98) 

-.30*’' 

(98) 

3.  X Errors^ 

-.28* 

(66) 

-.12 

(66) 

-.25  .07 

(66)  (80) 

.19 

(80) 

.34** 

(80) 

4.  SCAT  V 

.62** 

(138) 

.88**-. 05 
(138)  (138) 

.08 

(138) 

.11 

(138) 

5.  SCAT  Q 

.88**  .15 
(140)  (140) 

.00 

(140) 

.13 

(140) 

6.  SCAT  T 

.04 

(140) 

.02 

(140) 

.07 

(140) 

7.  Sex 

-.01 

(166) 

.03 

(166) 

8.  Grade 

.02 

(166) 

9.  Version0 

3The  n for  each  correlation  is  reported  in  ( ) below  each  coefficient 

bSince  nc  responding  was  elicited  in  version  7,  correlations  with  % errors 
are  based  only  on  versions  1 through  6. 

cVersions  was  quantified  for  the  calculation  of  the  correlation  coefficients 
by  utilizing  the  numbers  1 through  7 which  have  been  used  nominally  in  this 
study.  These  numbers  are  ordered  in  terms  of  frame  size,  but  they  do  not 
reflect  the  differences  in  frame  size.  The  total  number  of  responses  in  the 
material  or  the  number  of  original  frames  per  response  might  be  better  Indices 
of  the  versions  variable. 

* Indicates  significance  at  p < .05 

**  Indicates  significance  at  p < .01 
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test  as  the  predicted  variable,  versions  would  act  as  a suppressor 
variable  as  long  as  percentage  errors  was  in  the  equation  (but  the 
increase  in  R would  be  small  because  of  the  low  magnitudes  of  the 
correlations).  This  relationship  implies  that  the  version  influences 
the  error  rate,  but  that  this  influence  is  unrelated  to  the  criterion 
test.  When  the  variance  in  the  error  rate  (percentage  error)  due  to 
the  effect  of  versions  is  partialled  out,  the  relationship  of  error 
rate  to  the  criterion  test  is  increased.  (It  was  pointed  out  pre- 
viously that  atleast  some  of  the  relationship  between  error  rate 
and  the  criterion  test  is  due  to  mental  ability). 

The  intercorrelations  involving  the  three  SCAT  scores,  sex, 
grade,  and  versions  reflect  the  original  sampling  before  attrition. 

The  coefficients  for  sex,  grade  and  versions  differ  slightly  from 
zero  only  because  two  of  the  original  168  students  are  not  included 
and  because  of  rounding  errors . The  correlations  involving  the  SCAT 
scores  and  the  other  three  variables  are  based  upon  the  students  for 
whom  SCAT  scores  were  available.  These  coefficients  are  surprisingly 
low — especially  with  versions — since  the  SCAT  scores  were  not  taken 
into  consideration  in  sampling.  The  correlation  of  SCAT  quantitative 
with  sex  is  significant  (r  = .15,  p <.05)  and  the  correlation  with 
versions  approaches  significance  (r  = .13,  p < .10)  but  in  both  cases 
only  about  2%  of  the  variance  is  held  in  common.  These  figures,  however, 
cannot  be  interpreted  as  an  indication  of  the  representiveness  of  the 
students  who  completed  the  study;  they  only  have  implications  for  the 
original  sample. 

Another  correlation  coefficient  which  requires  comment  is  that 
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between  completion  time  and  percentage  error  rate  (-.01).  Percentage 
error  rate  is  reported  in  Table  19  instead  of  raw  error  rate  because  the 
raw  error  rate  strongly  reflects  the  different  response  frequencies  which 
distinguish  the  different  versions.  However,  use  of  the  percentage  error 
rate  may  cloud  the  true  relationship  between  error  rate  and  completion 
time.  Completion  time  should  be  a function  of  the  number  of  errors  made 
since  making  errors  should  be  time  consuming  (this  factor  was  omitted  in 
the  previous  discussion  of  completion  time  because  error  rate  had  not  yet 
been  discussed  and  because  time  consumed  by  errors  still  does  not  explain 
the  curved  nature  of  the  time  and  versions  relationship) . The  correlation 
of  raw  error  rate  with  completion  time  is  .39  which  is,  of  course,  signifi- 
cant. Part  of  this  relationship  can  be  explained  away  by  the  fact  that 
both  error  rate  and  completion  time  are  strongly  influenced  by  the  frame 
sizes.  The  correlation  of  raw  error  rate  with  versions  (quantified  by 
ranking  only)  is  -.54  and  the  correlation  of  completion  time  with  versions 
is  -.30.  Thus,  the  partial  correlation  of  raw  error  with  completion  time  is 
.29,  which  suggests  that  completion  time  is  a function  of  error  rate,  in 
addition  to  reading  time  and  number  of  responses. 

Based  upon  this  information,  the  a priori  equation  for  completion 
time  can  now  be  modified  to  include  error  rate.  Still  assuming  linearity, 
the  equation  becomes : 

T = R + tF  + eE 

where  e is  the  time  required  to  make  an  error  and  E is  the  number  of 
errors  made.  In  order  to  assess  the  contribution  of  e,  a system  of 
three  equations  were  set  up  using  the  mean  completion  times,  mean  error 
rates,  and  the  number  of  responses  for  versions  1,  2,  and  3.  The  resul- 
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tant  equations  are: 

347.0  = R + 864 t + 36. 4e 
306.2  = R + 432t  + 36. 9e 
281.6  = R + 216t  + 21. 4e 

Since  completion  time  is  essentially  linear  considering  only  versions  1,2, 

and  3,  these  equations  were  solved  simultaneously  giving  the  following  values 

R = 259.6  min. 
r = .10  min. 

e = .03  min. 

While  the  curvilinear  nature  of  the  relationship  across  all  versions  tends  to 
negate  the  premises  these  values  are  based  upon,  they  should,  nevertheless, 
give  an  indication  of  the  relative  magnitudes  of  the  three  time  factors.  As 
can  be  seen  from  the  value  for  e,  33  + errors  would  have  to  be  made  before 
a minute  is  added  to  the  completion  time.  Since  the  largest  mean  number  of 
errors  is  36.9,  error  rate  does  not  seem  to  be  a major  factor  in  completion 
time.  Further,  since  error  rate  is  not  independent  of  the  number  of  response 
elicitations,  there  is  little  to  be  gained  from  having  it  in  the  equation. 

Attitudes 

As  was  pointed  out  in  Chapter  III,  the  instruments  for  the  measurement 
of  attitudes  are  limited  in  scope. 

Four  questions  on  the  "Psychology  Questionnaire"  were  designed  to  ex- 
amine aspects  of  the  students'  attitudes  toward  the  programmed  material. 

Since  all  of  the  variables  except  frame  size  are  constant  for  all  versions, 
differences  in  responding  among  treatment  groups  should  reflect  the  influence 
of  frame  size.  Of  course,  previous  interests  in  psychology  and  other  possi- 
ble individual  differences  would  affect  the  responses  also. 


Table  20  reports  responses  to  the  question,  "Would  you  like  to  read 
more  about  psychology?"  The  question  assumes  that  if  the  material  is  enjoy- 
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Table  20 

Responses  to  the  Question: 

"Would  you  like  to  read  more  about  psychology?" 


Version 

Response 

1 

2 

3 

4 

5 

6 

7 

Total 

# 

YES 

5 

11 

11 

9 

9 

9 

17 

71 

% 

55.6 

91.7 

68.8 

64.3 

56.3 

81.8 

94.4 

71.9 

// 

4 

1 

5 

5 

7 

2 

1 

25 

NO 

% 

44.4 

8.3 

31.2 

35.7 

43.7 

18.2 

5.6 

28.1 

Total3 

9 

12 

16 

14 

16 

11 

18 

96 

aTotal  cell  frequencies  differ  slightly  from  those  previously 
stated  because  questionnaires  for  two  students  are  not  available. 


able,  the  student  will  want  to  read  more.  The  expected  frequencies  (based 
on  the  marginal  total)  in  seven  of  the  14  cells  in  Table  20'  are  less  than 
five  which  prevents  the  calculation  of  a x2  • The  percentages  of  "Yes" 
responses  vary  from  a low  of  55.6  for  version  1 to  a high  of  94.4  for  ver- 
sion 7;  however,  the  relationship  is  not  smooth  between  versions  1 and  7. 
If  version  1 is  ignored,  the  relationship  between  the  percentage  of  "Yes" 
responses  and  versions  is  curvilinear  going  from  an  initial  high  for  ver- 
sion 2 to  a low  for  version  5 and  then  back  up  for  versions  6 and  7.  Un- 
fortunately, little  can  be  concluded  from  the  data  in  Table  20  since  the 
three  highest  percentages  are  for  versions  2,  6,  and  7.  While  a case 
could  be  made  for  6 and  7 differing  from  the  other  versions,  it  is  diffi- 
cult to  explain  the  high  response  in  version  2. 
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Table  21 

Responses  to  the  Question: 

' Would  you  rather  read  a regular  textbook  on  psychology  than 
have  it  taught  by  the  materials  you  have  just  completed?" 


Version 

Response  1 

2 

3 

4 

5 

6 

7 

Total 

# 

4 

7 

6 

7 

8 

6 

11 

49 

YF.S 

% 

44.4 

63.6 

37.5 

50.0 

50.0 

66.7 

66.7 

53.3 

# 

5 

4 

10 

7 

8 

3 

6 

43 

NO 

% 

55.6 

36.4 

62.5 

50.0 

50.0 

33.3 

35.3 

46.3 

Total3 

9 

11 

16 

14 

16 

5 

17 

92 

aTotal  of  all  frequencies  differ  slightly  from  those  previously  stated 
Decause  questionnaires  for  two  students  are  not  available  and  because 
some  students  did  not  respond  to  this  question- 


Table  21  presents  the  data  for  the  question,  "Would  you  rather  read  a 
regular  test  book  on  psychology  than  have  it  taught  by  the  materials  you 
have  just  completed?"  Four  of  the  cells  (29%)  in  this  table  have  expected 
frequencies  below  5 which  prevents  the  use  of  x2  to  determine  if  non-chance 
relationships  exist.  As  with  the  previous  question,  there  is  no  clear  trend 
to  these  responses. 

Table  22  contains  the  responses  to  the  question,  "Did  you  like  the  way 
psychology  was  taught  in  the  materials  you  have  just  completed?"  Three  or 
about  21%  of  the  cells  in  this  table  have  expected  frequencies  below  5. 

This  figure  is  borderline  for  x2»  so  a x2  was  calculated.  The  obtained 
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Table  22 

Responses  to  the  Question: 

"Did  you  like  the  way  psychology  was  taught 
in  the  materials  you  have  just  completed?" 


Version 


Response 

1 

2 

3 

4 

5 

6 

7 

Total 

# 

YF^  

5 

3 

8 

8 

7 

4 

7 

42 

% 

55.6 

27.3 

50.0 

61.5 

44.7 

36.4 

38.8 

43.6 

# 

NO 

4 

8 

8 

5 

9 

7 

11 

52 

% 

44.4 

72.7 

50.0 

38.5 

55.3 

63.6 

61.2 

56.4 

Total3 

9 

11 

16 

13 

16 

11 

18 

94 

X2  = 4. 

02,  df 

= 6,  p < 

.40 

aTotal  of  all  frequencies  differ  slightly  from  those  previously  stated 
because  questionnaires  for  two  students  are  not  available  and  because 
some  students  did  not  respond  to  this  question. 


value  is  4.02  (df  = 6)  which  is  not  significant  (p  >.60).  No  trend  is  dis- 
cernable  in  the  percentages  of  responses  across  versions;  there  are  more 
reversals  here  than  in  either  of  the  two  previous  questions. 

Responses  to  the  question,  "Would  you  like  to  become  a psychologist?" 
are  reported  in  Table  23.  Half  of  the  cells  in  this  table  have  expected  fre- 
quencies of  less  than  5,  which  again  prevents  the  use  of  x2-  While  there  is 
no  apparent  trend  across  the  first  5 versions,  versions  6 and  7 have  a higher 
percentage  of  "Yes"  responses  than  do  the  others.  The  observed  frequencies 
for  versions  1 through  5 were  combined  and  those  for  6 and  7 were  combined 
and  a x2  was  calculated  for  the  resultant  2x2  contingency  table.  The 
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Table  23 

Responses  to  the  Question: 

"Would  you  like  to  become  a psychologist?" 


Version 

Response 

1 

2 

3 

4 

5 

6 

7 

Total 

it 

1 

1 

3 

1 

*5 

J 

4 

7 

20 

Yhb 

% 

11.1 

8.3 

18.8 

7.1 

18.8 

40.0 

38.9 

21.1 

a 

NO 

8 

11 

13 

13 

13 

6 

11 

75 

% 

88.9 

91.7 

81.2 

92.9 

81.2 

60.0 

61.1 

78.9 

Total3 

9 

12 

16 

14 

16 

10 

18 

95 

aTotal  of  all  frequencies  differ  slightly  from  those  previously  stated 
because  questionnaires  for  two  students  are  not  available  and  because 
some  students  did  not  respond  to  this  question. 

value  is  highly  significant  (x2  = 7.9,  df  = 1,  p < .005).  Thus,  persons  com- 
pleting versions  6 and  7 indicated  that  they  would  like  to  become  psycholo- 
gists significantly  more  frequently  then  those  in  the  other  versions.  Since 
no  pre-treatment  data  were  collected  on  this  question,  it  is  impossible  to 
ascertain  whether  this  finding  is  a function  of  the  versions  or  of  some  pre- 
existing interest  of  the  students  in  these  groups.  If  it  is  a function  of 
the  versions,  it  indicates  that  the  students  reading  materials  with  very 
few  and  with  no  responses  are  more  influenced  by  the  content  than  those 
who  have  more  responses.  This  interpretation  should  be  regarded  very  caut- 
iously. On  the  other  hand,  if  it  indicates  a preexistant  interest  in  psy- 
chology, it  would  suggest  that  the  groups  did  have  initial  differences  and 
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that  the  criterion  test  results  may  be  influenced  by  differential, 
preexisting  interests.  Which  of  these  interpretations  best  describes 
the  actual  situation  cannot  be  determined  until  the  study  is  replicated. 

The  questionnaire  which  was  sent  to  the  students  after  the  comple- 
tion of  the  study  contained  several  questions  which  provide  information 
about  the  student's  attitudes.  These  questions  are: 

(1)  If  you  were  requested  to  participate  in  a similar  study  sometime 
in  the  future,  would  you? 

(2)  Do  you  think  that  a reward  should  be  given  to  those  who  participate? 

(3)  Would  you  like  to  see  the  materials  which  you  read  become  part  of 
the  regular  course  work  at  Nova  High  School? 

(4)  Did  you  learn  anything  about  psychology  from  participating  in  the  study? 
Since  the  students  had  to  complete  the  questionnaires  and  mail  them  back, 
responses  are  available  for  only  63  of  the  98  students  who  completed  the 
study.  Additionally  13  of  these  are  anonymous  and  the  version  cannot  be 
identified.  Thus,  only  50  students  can  be  used  for  comparisons  among  the 
versions.  This  complicates  both  the  analysis  and  the  interpretation  of 

the  data  because  of  the  reduced  N and  because  of  the  difficulty  of  assum- 
ing that  the  questionnaires  which  were  returned  are  representative  of 
these  which  were  not. 

Table  24  presents  the  data  regarding  the  future  participation  study. 
There  are  too  many  cells  with  expected  frequency  below  5 to  calculate 
a x2-  No  particular  trend  emerges  from  the  data;  most  of  the  students 
would  be  willing  to  participate  again. 

Table  25  contain  the  responses  regarding  the  rewards.  Again  a x2 
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Table  24 

Responses  to  the  Question: 

"If  you  were  requested  to  participate  in  a similar 
study  sometime  in  the  future,  would  you?" 


Version 

Response 

1 

2 

3 

4 

5 

6 

7 

Total 

YES 

# 

4 

2 

7 

7 

5 

5 

6 

36 

% 

80.0 

50.0 

63.6 

77.8 

83.3 

62.5 

85.7 

72.0 

NO 

// 

0 

1 

2 

0 

0 

1 

0 

54 

% 

0.0 

25.0 

18.2 

0.0 

0.0 

12.5 

0.0 

28.0 

DON'T 

KNOW 

# 

1 

1 

2 

2 

1 

2 

1 

10 

% 

20.0 

25.0 

18.2 

22.2 

16.7 

25.0 

14.3 

20.0 

Total 

v 

5 

4 

11 

9 

6 

8 

7 

50 

cannot  be  calculated  and  no  clear  trend  is  detectable.  Table  26  gives  the 
responses  for  the  question  pertaining  to  the  materials  becoming  part  of 
regular  course.  A cannot  be  calculated  because  of  the  low  expected  fre- 
quencies and  no  relationship  to  versions  is  apparent.  The  last  question  is 
summarized  in  Table  27.  "Yes"  responses  are  very  high  for  every  group  indi- 
cating that  almost  all  the  students  responding  state  that  they  did  learn 
some  psychology  in  the  study. 

The  failure  to  produce  more  meaningful  results  regarding  attitudes  is 
primarily  due  to  the  limited  nature  of  the  instruments.  This  is  unfortunate 
and  should  be  corrected  in  future  studies. 
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Table  25 

Responses  to  the  Question: 

"Do  you  think  that  a reward  should 
be  given  to  those  who  participate?" 


Version 

Response 

1 

2 

3 

4 

5 

6 

7 

Total 

# 

YES 

3 

0 

6 

3 

3 

4 

5 

24 

% 

60.0 

0.0 

54.5 

33.3 

50.0 

50.0 

71.4 

48.0 

# 

2 

4 

5 

6 

3 

4 

2 

25 

MU  

% 

40.0 

100.0 

45.5 

67.3 

50.0 

50.0 

28.6 

52.0 

Total 

5 

4 

11 

9 

6 

8 

7 

50 

Table  26 

Responses  to 

the  Question: 

"Would  you  like  to  see  the 

materials  which 

you  read  become 

part 

of  the 

regular 

course 

work 

at  Nova  High  School?" 

Version 

Response 

1 

2 

3 

4 

5 

6 

7 

Total 

# 

5 

0 

7 

7 

4 

3 

C 

-/ 

31 

YjjS 

% 100.G 

0.0 

63.6 

77.8 

66.7 

'37.5 

71.4 

62.0 

if 

0 

4 

4 

2 

2 

5 

2 

15 

NO  - 

% 

0.0 

100.0 

36.4 

22.2 

33.3 

62.5 

18.6 

38.0 

Total 

5 

4 

11 

9 

6 

8 

50 
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Table  27 

Responses  to  the  Question: 

"Did  you  learn  anything  about  psychology 
from  participating  in  the  study?" 


Version 

Response 

1 

2 

3 

4 

5 

6 

7 

Total 

# 

5 

3 

9 

9 

5 

7 

6 

45 

I lib 

% 

100.0 

75.0 

90.0 

100.0 

83.3 

87.5 

85.7 

90.0 

# 

NO 

0 

1 

1 

0 

1 

1 

1 

5 

% 

0.0 

25.0 

10.0 

0.0 

16.7 

12.5 

14.3 

10.0 

Total 

5 

4 

10 

9 

6 

8 

7 

50 

CHAPTER  V:  DISCUSSION  AND  CONCLUSIONS 


The  most  important  outcome  of  the  study  was  the  failure  to 
obtain  statistical  significance  among  the  treatment  groups  on  the 
criterion  test.  This  suggests  that  frame  size  (i.e.,  the  frequency 
of  responding)  does  not  make  a difference  on  the  students'  mean  perfor- 
mance, and  that  individual  differences  are  the  most  important  deter- 
minant of  success,  holding  the  other  material  variables  constant.  If 
the  thesis  that  programmed  instruction  requires  overt  responding  at 
very  frequent  intervals  is  a valid  one,  then  there  should  have  been 
significant  differences  among  the  means  on  the  treatment  groups. 

It  must  be  remembered  that  this  particular  study  has  not  yet 
been  replicated  and,  therefore,  its  findings  must  be  regarded  as  ten- 
tative at  this  point.  While  the  first  assumption  is  that  frequency 
of  responding  makes  no  difference,  the  observed  phenomena  could  have 
other  explanations  such  as  the  frequently  ignored  Type  II  error  where 
an  invalid  null  hypothesis  is  not  rejected.  Further,  the  criterion 
test  may  not  be  adequately  measuring  the  students'  knowledge  or  it 
may  be  unable  to  differentiate  among  the  versions,  even  though  differ- 
ences may  exist.  Although  the  treatments  did  not  differ  on  the  inde- 
pendent variables  studied,  there  are,  of  course,  many  independent 
variables  which  were  not  studied.  It  is  possible  that  these  variables 
were  operative  in  such  a way  to  negate  differences  among  treatment  means. 
Further,  there  was  not  a test  for  long-term  retention,  consequently  the 
different  treatments  may  produce  differential  results  over  a period  of 
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time.  Of  course,  long-term  retention  is  usually  the  goal  of  classroom 
learning  and  if  one  treatment  did  produce  more  permanent  results  than 
another,  this  is  a more  important  finding  than  the  effects  on  immediate 
retention.  Consequently,  the  study  should  be  repeated  in  the  very  near 
future  with  different  samples  of  students  and  with  different  programmed 
materials.  If  the  findings  do  withstand  replication,  it  will  certainly 
be  a milestone  in  programmed  instruction.  In  that  case,  the  other  vari- 
ables, such  as  redundancy  and  discrete  steps,  must  be  analyzed  to  deter- 
mine what,  if  any,  the  important  variables  of  programmed  material  are. 

An  important  limitation  of  the  study  was  the  absence  of  contingencies 
on  the  outcome  of  the  criterion  test.  Rather,  responding  on  the  test  was 
a free  operant  situation  in  which  the  students  were  neither  extrinsically 
punished  nor  extrinsically  rewarded  for  their  performance.  The  existence 
of  a contingency  on  the  test  scores — such  as  money  for  a certain  obtained 
score  might  force  differences  in  the  student's  performance  among  the  ver- 
sions that  were  not  present  in  the  absence  of  a contingency.  Future  repli- 
cations of  the  study  should  manipulate  criterion  test  contingencies  to  de- 
termine if  they  will  produce  different  results. 

It  should  be  made  clear  that  the  failure  of  the  study  to  find  signifi- 
cant difference  among  group  means  on  the  criterion  test  does  not  imply  that 
a particular  individual  will  learn  equally  well  regardless  of  the  frame  size 
or  frequency  of  responding.  Programmed  learning  has  frequently  been  regarded 
as  a universal  panacea  which  will  benefit  all  people  equally  well.  This  is  > 

probably  a naive  position,  considering  all  we  know  about  the  differences 
among  individuals  and  the  little  we  know  about  how  humans  acquire  knowledge. 

It  is  difficult  to  conceive  of  a unique  media  or  method  of  instruction  work- 
ing equally  well  with  all  people  and  it  is  extremely  unlikely  that  such 
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a universal  method  will  ever  be  found,  unless  it  comes  from  fields  such 
as  psychopharmacology  or  psychoneurology.  Thus,  the  important  aspect  of 
the  present  study  was  not  to  cast  doubt  upon  the  necessity  for  short 
frames  with  high  response  frequency  per  se,  but  rather  to  cast  doubt  upon 
this  as  a universal  method.  It  cannot  be  determined  from  the  data  in  the 
present  study  whether  any  particular  individual  will  work  better  with  one 
method  than  he  does  with  another  method.  However,  this  is  certainly  a 
possibility  and  it  is  very  definitely  a topic  for  further  research.  While 
it  may  very  well  be  that  only  a few,  if  any,  individuals  work  at  their  op- 
timum level  with  a high  frequency  of  responding,  the  possibility  that  many 
individuals  work  best  with  a high  frequency  of  responding  is  not  contra- 
dicted by  the  study. 

One  of  the  potential  uses  of  programmed  instruction  is  to  aid  in  in- 
dividualizing instruction  where  each  person  is  enabled  to  learn  at  his  own 
optimal  level  within  the  limits  of  the  available  resources.  It  has  been 
recognized  that  the  conventional  classroom  containing  30  or  40  students, 
taught  by  a single  teacher,  is  not  necessarily  the  best  method  for  impart- 
ing knowledge.  However,  to  substitute  some  other  method  across  the  board 
to  all  students  would  be  equally  bad.  Instead,  researchers  need  to  pro- 
vide information  which  can  aid  in  the  establishment  of  guidelines  for  the 
optimal  implementation  of  instructional  materials.  At  present,  it  is  not 
feasible  to  study  every  individual  to  ascertain  the  method  by  which  he 
learns  best,  although  with  the  aid  of  computer-based  instruction  and  other 
forms  of  computer  assistance  in  the  instructional  process,  such  a procedure 
may  eventually  be  possible.  However,  until  that  time,  a feasible  path  for 
the  researcher  to  follow  is  to  identify  various  sub-groups  or  clusters  of 
individuals  who  have  in  common  the  characteristic  of  optimal  learning  in 
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the  same  general  way.  For  example,  one  group  might  be  identified  which 
learns  best  using  Crowder's  auto-tutors  for  physics,  while  another  group 
might  be  identified  which  learn  physics  best  by  the  lecture  method.  It 
is  assumed,  of  course,  that  the  variables  associated  with  the  methods  by 
which  each  group  learns  best  can  be  identified  and  measured.  Thus,  an 
important  line  of  research  to  follow  studies  such  as  the  present  one,  is 
to  determine  the  characteristics  of  groups,  if  any,  which  learn  best  by 
programmed  instructional  materials  of  each  of  the  versions  presented  here 
and  by  other  possible  versions. 

If  a particular  group  or  cluster  of  students  is  identified  who  does 
succeed  best  with  version  1 of  the  materials  as  presented  in  this  study 
using  psychology  as  a subject  matter,  then  other  subject  matters  should 
be  also  investigated  with  that  group  to  determine  if  version  1 works 
equally  well  regardless  of  content,  or  if  it  interacts  significantly 
with  the  content.  Then,  if  predictive  variables  can  be  identified  in 
that  group  or  cluster,  similar  groups  or  clusters  can  be  sampled  else- 
where and  the  studies  can  be  replicated  to  see  if  the  assumptions  hold. 

In  this  systematic  fashion,  researchers  can  proceed  to  establish  some 
basic  constructs  for  a theory  of  teaching  or  a theory  of  instruction. 

Thus,  the  present  study  opens  up  many  avenues  of  related  research; 
however,  the  first  and  most  important  line  of  research  is  the  replication 
of  the  present  study.  The  findings  of  no  study  can  be  considered  definite 
unless  they  have  been  replicated  several  times  without  contradictory  re- 
sults. After  the  replications,  and  if  the  findings  are  maintained,  it 
will  be  possible  to  move  into  other  related  areas,  such  as  those  discussed 


above . 
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It  would  appear  that  not  only  does  responding  or  not  responding 
fail  to  make  a difference  on  the  criterion  test,  but  that  responding 
requires  significantly  longer  completion  times  than  nonresponding. 

The  findings  on  completion  time  are  slightly  confusing,  because  the  re- 
lationship between  frame  size  and  completion  time  appears  to  be  curvi- 
linear. However,  this  relationship  could  be  merely  a phenomenon  unique 
to  this  particular  sample.  If,  however,  the  phenomenon  of  finding  the 
lowest  completion  time  to  be  for  version  5 (where  responses  were  elicited 
every  sixteenth  frame)  is  a valid  finding,  it  might  suggest  that  occa- 
sional responding  keeps  the  student  working  at  a faster  pace  than  either 
greater  or  less  responding.  This  might  be  because  the  responding  is  fre- 
quent enough  to  keep  his  attention,  but  is  not  so  frequent  that  it  be- 
comes unduly  monotonous.  However,  at  this  point,  this  is  only  specula- 
tion and  further  research  must  be  conducted  to  determine  if  the  curvi- 
linear relationship  holds  for  other  samples  and  if  it  does,  the  reasons 
why  it  occurs. 

The  data  of  this  study  do  indicate  that  a high  frequency  of  respond- 
ing requires  more  time  to  complete  the  materials  than  do  low  frequencies 
of  responding.  Every  version  of  the  material,  except  version  2,  required 
significantly  less  time  than  did  version  1.  Additionally,  3 of  the  ver- 
sions required  significantly  less  time  than  did  version  2.  Thus,  if  the 
frame  size  is  not  related  to  the  criterion  test  scores,  but  does  affect 
completion  time,  an  effective  argument  is  made  for  not  having  short  frames 
as  defined  in  the  present  study. 
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The  study  of  error  rate  revealed  little  except  that  error  rate  is 
a weak  predictor  of  the  criterion  test  used.  There  is  not  much  need 
to  study  error  rate  by  itself.  The  intrinsic  interest  in  error  rate 
manifest  in  other  types  of  learning  studies  is  not  present  here,  because 
the  acquisition  does  not  occur  over  repeated  trials.  The  object  of 
learning  continually  changes  throughout  the  materials  so  that  determina- 
tion of  the  change  in  error  rate  over  time  does  not  necessarily  imply 
the  level  of  learning.  A study  could  be  designed  to  investigate  the 
possible  relationships  of  changes  in  error  rate  to  other  criterion 
measures,  such  as  achievement  tests.  The  relationship,  if  it  exists, 
could  be  an  important  factor  in  designing  materials  to  optimize  learn- 
ing. Such  a study,  however,  could  be  best  carried  out  through  the  use 
of  computer-assisted  instruction  where  continuous  student  records  can 
be  maintained. 

The  failure  to  find  conclusive  differences  in  the  students'  attitudes 
is  not  surprising,  due  to  the  limited  nature  of  the  instruments.  As  was 
explained  previously,  the  inadequacies  of  the  questionnaires  are  largely 
due  to  local  restrictions  on  attitude  testing.  The  main  intent  of  the 
study  was  to  determine  the  effect  of  frame  size  on  cognitive  acquisition, 
which  was  done.  However,  the  affective  variables  should  not  be  ignored, 
since  they  may  be  more  important  than  the  cognitive  ones  in  the  long  run. 

The  significant  differences  which  were  found  are  not  clearly  due  to 
either  the  sampling  and  procedure  or  to  preexisting  attitudes.  In  the 
case  of  more  low-responding-group  students  wanting  to  be  psychologists 
than  those  in  the  higher-responding-groups , it  is  unlikely  that  the  ran- 
dom sampling  error  would  cause  that  concentration,  although  the  attrition 
conceivably  could  have  caused  it.  It  would  seem  most  likely  that  the 


-77- 


difference  is  due  to  treatments,  but  this  cannot  be  categorically  as- 
serted. 

On  the  other  hand,  the  finding  that  more  students  who  completed 
the  study  returned  questionnaires  than  those  who  did  not  complete  the 
study  is  probably  due  to  preexistent  factors  which  also  contributed  to 
the  attrition  itself.  That  is,  the  personality  variables  which  led  some 
people  to  drop  out  of  the  study,  most  likely  led  them  not  to  return  the 
questionnaire  either.  However,  this  cannot  be  stated  with  assurance. 

The  explanation  may  be  that  those  who  did  not  complete  the  study  were 
embarrassed  by  the  prospect  of  further  contact  with  the  study  and  ignored 
the  questionnaire  to  avoid  that  embarrassment.  Also,  there  could  be  other 
explanations,  all  of  which  are  speculation  with  little  supportive  evi- 
dence to  go  on. 

Another  consideration  in  measuring  attitudes  toward  new  media  and 
new  materials  in  an  experimental  situation  is  that  novelty  effects  and 
Hawthorne  effects  may  distort  the  measurement.  Confrontation  with  new 
materials  may  influence  attitudes  in  either  direction.  If  the  person 
feels  less  comfortable  with  the  new  materials  than  he  does  with  more  fa- 
miliar materials,  he  may  degrade  them.  However,  if  he  has  had  a chance 
to  become  accustomed  to  the  new  materials,  he  may  rate  them  higher.  On 
the  other  hand,  the  new  materials  may  fascinate  him  because  they  are 
different  and  he  may  race  them  higher  than  if  he  were  more  accustomed  co 
them.  The  Hawthorne  effect,  of  course,  would  inflate  the  racings  across 
all  experimental  groups.  Thus,  differences  in  attitude  may,  in  part,  be 
due  to  these  influences  of  the  situations. 

In  summary,  this  study  has  presented  evidence  that  the  Skinnerian 
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tenet  of  small-sized  frames,  each  with  its  own  elicitation  of  an  overt 
response  may  not  be  valid,  since  no  difference  could  be  statistically 
detected  among  the  seven  treatment  groups.  Since  this  finding  could  be 
due  to  extraneous  factors  influencing  the  data,  it  must  be  replicated  - 
as  all  studies  should  be.  Further,  while  the  study  suggests  that  stu- 
dents in  general  do  not  differ  across  versions,  it  does  not  imply  that 
a particular  individual  will  not  do  better  with  one  method  than  with 
another.  The  completion  time  data  suggest  that  occasional  responding 
can  decrease  the  time  required  to  complete  the  materials.  This  finding 
also  must  be  replicated. 


APPENDIX  A:  REASONS  FOR  ATTRITION 


A questionnaire  was  sent  to  all  of  the  students  who  had  been 
selected  for  the  study  and  who  did  not  finish  to  determine  why  they  did 
not  finish.  The  questionnaire  listed  6 different  reasons  for  non-comple- 
tion, plus  an  "other"  reason.  The  reasons  listed  on  the  questionnaire  are: 

(1)  I was  absent  from  school  too  much  of  the  time. 

(2)  Regular  school  work  kept  me  too  busy. 

(3)  My  parents  did  not  want  me  to  participate. 

(4)  I felt  that  the  study  was  a waste  of  my  time. 

(5)  I just  did  not  want  to  participate. 

(6)  I wanted  to  finish,  but  the  study  was  discontinued  before  I 
was  done . 

Reasons  1,  2,  4 and  6 were  included  because  of  specific  feedback  from  the 
students  in  the  study.  Reason  3 was  included  because  many  of  the  children 
whose  parents  called  the  researcher  after  they  were  notified  of  the  study 
did  not  participate.  Reason  5 was  added  to  detect  if  some  students  felt 
uneasy  about  research  participation. 

Table  28  presents  the  results  of  the  questionnaires.  No  one  checked 
reasons  3 and  5 so  these  have  been  excluded  from  the  table.  In  some  cases, 
students  checked  multiple  reasons  so  that  the  sums  of  the  frequencies  exceed 
the  number  of  students.  Since  all  of  the  students  did  not  return  the  question- 
naire, the  frequencies  listed  cannot  be  regarded  as  descriptive  of  ail  the 
non-participants.  In  fact,  the  people  who  did  not  return  the  questionnaire 
should  probably  be  considered  to  have  different  reasons  for  not  participating 
from  those  who  did  return  the  questionnaire. 
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Table  28 

Reasons  Given  by  Students  Who  Did  Not  Complete  the  Study 


Completion  Status 

Participation 

No 

Partici- 

pation 

Did  Not 
Finish  the 
Materials 

Finished 
Materials 
but  Did  Not 
Take  Test 

Total 

Total  in  Category 

15 

36 

17 

68 

Number  Responding 

9 

14  (39%) 

9 (53%) 

32  (47%) 

Reason  Given 

Absent  from  School 

1 

6 

2 

9 

Too  Busy  With 
School  Work 

6 

12 

2 

20 

The  Study  was  a 
Waste  of  Time 

0 

3 

0 

3 

Study  was  Discon- 
tinued Too  Soon 

0 

3 

1 

4 

Othera 

2 

2 

5 

9 

Total  Responses^ 

9 

26 

10 

L: 

aOther  reasons  group  into  4 categories:  Busy  in  extracurricular  activities 

(3  students);  could  not  find  time  to  take  the  test  (2  students);  did  not 
like  the  material  (2  students);  and  misunderstood  (2  students). 

bBecause  multiple  responses  were  permitted,  total  responses  exceeds  the 
number  of  respondents. 
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In  general,  the  most  common  reason  given  is  "too  busy  with  other 
school  work."  This  is  reflective  of  what  one  student  termed  the  "end 
of  grading  period  crush"  which  conflicted  with  the  terminal  days  of  the 
study  — a conflict  which  unfortunately  arose  due  to  unforeseeable  cir- 
cumstances which  delayed  the  study.  One  girl  who  did  not  complete  the 
materials  wrote:  "I  was  very  sorry  not  to  be  able  to  complete  the  test, 
but  as  it  took  place  the  last  two  weeks  of  the  grading  period,  I was  un- 
able to  miss  the  time  from  my  various  classes."  Another  girl  wrote  that 
there  were  "too  many  notebooks  and  tests  due  to  continue  in  study."  A 
boy  checked  "Regular  school  work"  and  wrote  "I  wanted  to  complete,"  while 
a girl  checking  the  same  reason  wrote  "It  isn't  because  I didn't  like  it 
— it  interested  me  very  much."  Consequently,  it  is  assumed  that  had  the 
timing  been  different,  the  attrition  rate  would  have  been  less. 


APPENDIX  B:  PROGRAMMED  MATERIAL 


Examples  of  each  of  the  seven  versions  of  material  are  presented 
on  the  following  pages.  The  content  is  identical  for  all  versions  and 
is  from  the  first  four  pages  of  Part  I of  the  materials  used. 


VERSION  1 


reflexes 

(reflex) 


responds 

(reacts) 


response 

(reaction) 


hammer 

(mallet) 


tap  (blow) 


stimulus 


stimulus 
(tap  on 
the  knee) 


A doctor  taps  your  knee  (patellar  tendon)  with 
a rubber  mallet  to  test  your  . 


If  your  reflexes  are  normal,  your  leg  

to  the  tap  on  the  knee  with  a slight  jerk  (the 
so-called  knee  jerk). 


In  the  knee  jerk  or  patellar-tendon  reflex, 

the  kick  of  the  leg  is  the  to  the 

tap  on  the  knee. 


The  stimulating  object  used  by  the  doctor  to 
elicit  a knee  jerk  is  a . 


The  stimulus  which  elicits  a knee  jerk  is  the 

delivered  by  the  so-called  stimulus  object 

or  hammer. 


In  the  knee  jerk  reflex,  we  call  the  rubber  hammer 
the  stimulus  object  and  the  tap  or  blow  the  


An  event  is  explained  when  its  cause  is  identified. 
The  "cause"  or  explanation  of  the  knee  jerk  is 
technically  the  which  elicits  it. 


Technically  speaking,  a reflex  involves  an 
eliciting  stimulus  in  a process  called  an 
elicitation.  A stimulus  a response. 
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elicits 


response 


elicited 


latency 


threshold 


blink 


latency 


stimulus 


VERSION  1 (Con't) 


In  a reflex,  the  stimulus  and  the  elicited 
response  occur  in  a given  temporal  order; 
first  the  stimulus  then  the 


A kick  of  the  leg  is  by  a tap  on 

the  patellar-tendon. 


The  time  which  elapses  between  the  onset  of  the 
stimulus  and  the  onset  of  the  response  is  called 
the  latency.  Thus  the  time  between  the  tap  and 
the  kick  is  the  of  the  knee-jerk  reflex. 


The  weakest  stimulus  sufficient  to  elicit  a response 
marks  the  threshold  of  the  reflex.  A tap  on  the 
knee  will  not  elicit  a kick  if  it  is  below  the 


If  you  when  something  brushes  your  eye, 

the  blink  is  a response. 


The  fraction  of  a second  which  elapses  between 

"brushing  the  eye"  and  "blink"  is  the  

of  the  reflex. 


In  the  patellar-tendon  reflex,  a forceful  tap 
elicits  a strong  kick;  a tap  barely  above  the 
threshold  elicits  a weak  kick.  Magnitude  of 
response  depends  on  the  intensity  of  the  


(tap)  The  function  of  a response  corresponds  to  (is 

a function  of)  the  of  the  stimulus 

which  elicits  it. 
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intensity 

(magnitude, 

strength) 


response 


reflex 


threshold 


latency 


threshold 


greater 

(higher- 

larger) 


elicits 


VERSION  1 (Con't) 


A reflex  consists  of  both  a stimulus  and  a response, 
occurring  in  that  order.  The  term  "reflex"  is  not 
synonymous  with  the  single  term 


When  a person  is  startled  by  a loud  noise,  his 
sudden  movement  is  his  response  to  the  noise 
which  has  acted  as  a stimulus.  The  two  together 
are  called  a 


If  a sip  of  a very  weak  lemonade  does  not  cause 
salivation,  the  stimulus  is  said  to  be  below 
the 


In  the  salivary  reflex,  the  stimulus  (food) 
precedes  the  responses  (secretion  of  saliva) 
by  an  interval  of  time  called  the 


If  an  actress  uses  onion  juice  on  her  handkerchief 
to  elicit  tears  during  an  emotional  scene,  she 
must  use  enough  to  exceed  the  


The  greater  the  concentration  of  onion  juice  (stimulus) , 
the  the  magnitude  of  the  response. 


When  speaking  technically,  instead  of  saying 
onion  juice  "stimulates"  tears;  we  say  onion 
juice  tears. 


A child  touching  a hot  surface  withdraws  his 
hand  quickly.  The  word  "quickly"  suggests  that 
the  response  has  a short . 
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latency 


stimulus 


threshold 


reflex 


elicit 


magnitude 

(intensity) 


reflex 


threshold 


VERSION  1 (Con't) 


In  withdrawing  the  hand  from  the  hot  surface, 
arm  movement  is  a response  which  is  elicited 
by  a painful  to  the  hand. 


In  the  hand-withdrawal  reflex,  the  stimulus  must  be 

intense  enough  to  exceed  the  or  no  response 

will  occur. 


A light  flashed  into  the  eye  elicits  constriction 
of  the  pupil.  This  sequence  is  called  pupillary 


In  the  pupillary  reflex,  a flash  of  light  is  said 
to  response. 


In  the  pupillary  reflex,  a very  bright  flash  of 

light  elicits  a response  of  greater  

than  a weak  flash  of  light. 


A response  and  its  eliciting  stimulus  comprise 
a 


The  softest  touch  on  the  surface  of  the  eye  needed 

to  elicit  a blink,  marks  the of  the 

stimulus . 


In  any  reflex,  there  is  a stimulus  which 
the  response. 


elicits 
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VERSION  2 


A doctor  taps  your  knee  (patellar  tendon)  with 
a rubber  mallet  to  test  your  reflexes. 


If  your  reflexes  are  normal,  your  leg 

to  the  tap  on  the  knee  with  a slight  jerk  (the 
so-called  knee  jerk). 


responds  In  the  knee  jerk  or  patellar-tendon  reflex, 

(reacts)  the  kick  of  the  leg  is  the  response  to  the 

tap  on  the  knee. 


The  stimulating  object  used  by  the  doctor  to 
elicit  a knee  jerk  is  a . 


hammer  The  stimulus  which  elicits  a knee  jerk  is  the 

(mallet)  tap  delivered  by  the  so-called  stimulus  object 

or  hammer . 


In  the  knee  jerk  reflex,  we  call  the  rubber  hammer 
the  stimulus  object  and  the  tap  or  blow  the  


stimulus  An  event  is  explained  when  its  cause  is  identified. 

The  "cause'1  or  explanation  of  the  knee  jerk  is 
technically  the  stimulus  which  elicits  it. 


Technically  speaking,  a reflex  involves  an 
eliciting  stimulus  in  a process  called  an 
elicitation.  A stimulus  elicits  a 
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elicits 


elicted 


threshold 


latency 


VERSION  2 (Con't) 


In  a reflex,  the  stimulus  and  the  elicited 
response  occur  in  a given  temporal  order: 
first  the  stimulus  then  the  response. 


A kick  of  the  leg  is  by  a tap  on  the 

patellar-tendon. 


The  time  which  elapses  between  the  onset  of  the 
stimulus  and  the  onset  of  the  response  is  called 
the  latency.  Thus  the  time  between  the  tap  and 
the  kick  is  the  latency  of  the  knee-jerk  reflex. 


The  weakest  stimulus  sufficient  to  elicit  a response 
marks  the  threshold  of  the  reflex.  A tap  on  the 
knee  will  not  elicit  a kick  if  it  is  below  the  


If  you  blink  when  something  brushes  your  eye,  the 
blink  is  a response. 


The  fraction  of  a second  which  elapses  between 

"brushing  the  eye"  and  "blink"  is  the  

of  the  reflex. 


In  the  patellar-tendon  reflex,  a forceful  tap 
elicits  a strong  kick;  a tap  barely  above  the 
threshold  elicits  a weak  kick.  Magnitude  of 
response  depends  on  the  intensity  of  the  stimulus. 


The  magnitude  of  a response  corresponds  to  (is 

a function  of)  the  of  the  stimulus 

which  elicits  it. 


intensity 

(magnitude, 

strength) 


reflex 


latency 


greater 

(higher- 

larger) 
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VERSION  2 (Con't) 


A reflex  consists  of  both  a stimulus  and  a response, 
occurring  in  that  order.  The  term"reflex"  is  not 
synonymous  with  the  single  term  "response." 


When  a person  is  startled  by  a loud  noise,  his 
sudden  movement  is  his  response  to  the  noise 
which  has  acted  as  a stimulus.  The  two  together 
are  called  a 


If  a sip  of  a very  weak  lemonade  does  not  cause 
salivation,  the  stimulus  is  said  to  be  below 
the  threshold. 


In  the  salivary  reflex,  the  stimulus  (food) 
precedes  the  responses  (secretion  of  saliva) 
by  an  interval  of  time  called  the  


If  an  actress  uses  onion  juice  on  her  handkerchief 
to  elicit  tears  during  an  emotional  scene,  she 
must  use  enough  to  exceed  the  threshold. 


The  greater  the  concentration  of  onion  juice  (stimulus), 
the  the  magnitude  of  the  response. 


When  speaking  technically,  instead  of  saying 
onion  juice  "stimulates"  tears;  we  say  onion 
juice  "elicits"  tears. 


A child  touching  a hot  surface  withdraws  his 
hand  quickly.  The  word  "quickly"  suggests 
that  the  response  has  a short  . 
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latency 


threshold 


elicit 


reflex 


VERSION  2 (Con’t) 


In  withdrawing  the  hand  from  the  hot  surface, 
arm  movement  is  a response  which  is  elicited 
by  a painful  stimulus  to  the  hand. 


In  the  hand -withdrawal  reflex,  the  stimulus  must  be 

intense  enough  to  exceed  the  or  no  response 

will  occur. 


A light  flashed  into  the  eye  elicits  constriction 
of  the  pupil.  This  sequence  is  called  pupillary 
reflex. 


In  the  pupillary  reflex,  a flash  of  light  is  said 
to  response. 


In  the  pupillary  reflex,  a very  bright  flash  of 
light  elicits  a response  of  greater  magnitude 
than  a weak  flash  of  light. 


A response  and  its  eliciting  stimulus  comprise 
a 


The  softest  touch  on  the  surface  of  the  eye  needed 
to  elicit  a blink,  marks  the  threshold  of  the 
stimulus. 


In  any  reflex,  there  is  a stimulus  which 
the  response. 


elicits 
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Hammer 

(mallet) 


VERSION  3 


A doctor  taps  your  knee  (patellar-tendon)  with 
a rubber  mallet  to  test  your  reflexes. 


If  your  reflexes  are  normal,  your  leg  responds 
to  the  tap  on  the  knee  with  a slight  jerk  (the 
so-called  knee  jerk). 


In  the  knee  jerk  or  patellar-tendon  reflex, 
the  kick  of  the  leg  is  the  response  to  the 
tap  on  the  knee. 


The  stimulating  object  used  by  the  doctor  to 
elicit  a knee  jerk  is  a 


The  stimulus  which  elicits  a knee  jerk  is  the 
tap  delivered  by  the  so-called  stimulus  object 
or  hammer. 


In  the  knee  jerk  reflex,  we  call  the  rubber  hammer 
the  stimulus  object  and  the  tap  or  blow  the  stimulus. 


An  event  is  explained  when  its  cause  is  identified. 
The  "cause"  or  explanation  of  the  knee  jerk  is 
technically  the  stimulus  which  elicits  it. 


Technically  speaking,  a reflex  involves  an 
eliciting  stimulus  in  a process  called  an 
elicitation.  A stimulus  a response. 
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elicits 


threshold 


VERSION  3 (Con't) 


In  a reflex,  the  stimulus  and  the  elicited 
response  occur  in  a given  temporal  order; 
first  the  stimulus  then  the  response. 


A kick  of  the  leg  is  elicited  by  a tap  on 
the  patellar-tendon. 


The  time  which  elapses  between  the  onset  of  the 
stimulus  and  the  onset  of  the  response  is  called 
the  latency.  Thus  the  time  between  the  tap  and 
the  kick  is  the  latency  of  the  knee-jerk  reflex. 


The  weakest  stimulus  sufficient  to  elicit  a response 
marks  the  threshold  of  the  reflex.  A tap  on  the 
knee  will  not  elicit  a kick  if  it  is  below  the 


If  you  blink  when  something  brushes  your  eye, 
the  blink  is  a response. 


The  fraction  of  a second  which  elapses  between 
"brushing  the  eye"  and  "blink"  is  the  latency 
of  the  reflex. 


In  the  patellar-tendon  reflex,  a forceful  tap 
elicits  a strong  kick;  a tap  barely  above  the 
threshold  elicits  a weak  kick.  Magnitude  of 
response  depends  on  the  intensity  of  the  stimulus. 


The  magnitude  of  a response  corresponds  to  (is 

a function  of)  the  of  the  stimulus 

which  elicits  it. 


intensity 

(magnitude, 

strength) 


latency 
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VERSION  3 (Con't) 


A reflex  consists  of  both  a stimulus  and  a response 
occurring  in  that  order.  The  term  "reflex"  is  not 
synonymous  with  the  single  term  "response." 


When  a person  is  startled  by  a loud  noise,  his 
sudden  movement  is  his  response  to  the  noise 
which  has  acted  as  a stimulus.  The  two  together 
are  called  a reflex. 


If  a sip  of  a very  weak  lemonade  does  not  cause 
salivation,  the  stimulus  is  said  to  be  below 
the  threshold. 


In  the  salivary  reflex,  the  stimulus  (food) 
precedes  the  responses  (secretion  of  saliva) 
by  an  interval  of  time  called  the  . 


If  an  actress  uses  onion  juice  on  her  handkerchief 
to  elicit  tears  during  an  emotional  scene,  she 
must  use  enough  to  exceed  the  threshold. 


The  greater  the  concentration  of  onion  juice  (stimulus), 
the  greater  the  magnitude  of  the  response. 


When  speaking  technically,  instead  of  saying 
onion  juice  "stimulates"  tears;  we  say  onion 
juice  "elicits"  tears. 


A child  touching  a hot  surface  withdraws  his 
hand  quickly.  The  word  "quickly"  suggests 
that  the  response  has  a short  . 
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VERSION  3 (Con't) 


latency  In  withdrawing  the  hand  from  the  hot  surface, 

arm  movement  is  a response  which  is  elicited 
by  a painful  stimulus  to  the  hand. 


In  the  hand-withdrawal  reflex,  the  stimulus  must 
be  intense  enough  to  exceed  the  threshold  or  no 
response  will  occur. 


A light  flashed  into  the  eye  elicits  constriction 
of  the  pupil.  This  sequence  is  called  pupillary 
reflex. 


In  the  pupillary  reflex,  a flash  of  light  is  said 
to  response. 


elicit  In  the  pupillary  reflex,  a very  bright  flash  of 

light  elicits  a response  of  greater  magnitude 
than  a weak  flash  of  light. 


A response  and  its  eliciting  stimulus  comprise 
a reflex. 


The  softest  touch  on  the  surface  of  the  eye  needed 
to  elicit  a blink,  marks  the  threshold  of  the  stimulus. 


In  any  reflex,  there  is  a stimulus  which 
any  response. 


elicits 
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VERSION  4 


A doctor  taps  your  knee  (patellar-tendon)  with 
a rubber  mallet  to  test  your  reflexes. 


If  your  reflexes  are  normal,  your  leg  responds 
to  the  tap  on  the  knee  with  a slight  jerk  (the 
so-called  knee-jerk). 


In  the  knee-jerk  or  patellar-tendon  reflex, 
the  kick  of  the  leg  is  the  response  to  the 
tap  on  the  knee. 


The  stimulating  object  used  by  the  doctor  to 
elicit  a knee-jerk  is  a hammer. 


The  stimulus  which  elicits  a knee-jerk  is  the 
tap  delivered  by  the  so-called  stimulus  object 
or  hammer. 


In  the  knee-jerk  reflex,  we  call  the  rubber  hammer 
the  stimulus  object  and  the  tap  or  blow  the  stimulus. 


An  event  is  explained  when  its  cause  is  identified. 
The  "cause"  or  explanation  of  the  knee-jerk  is 
technically  the  stimulus  which  elicits  it. 


Technically  speaking,  a reflex  involves  an  eliciting 
stimulus  in  a process  called  an  elicitation.  A 
stimulus  a response. 
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VERSION  4 (Con't) 


elicits  In  a reflex,  the  stimulus  and  the  elicited 

response  occur  in  a given  temporal  order; 
first  the  stimulus  then  the  response. 


A kick  of  the  leg  is  elicited  by  a tap  on  the 
patellar-tendon . 


The  time  which  elapses  between  the  onset  of  the 
stimulus  and  the  onset  of  the  response  is  called 
the  latency.  Thus  the  time  between  the  tap  and 
the  kick  is  the  latency  of  the  knee-jerk  reflex. 


The  weakest  stimulus  sufficient  to  elicit  a response 
marks  the  threshold  of  the  reflex.  A tap  on  the 
knee  will  not  elicit  a kick  if  it  is  below  the 
threshold. 


If  you  blink  when  something  brushes  your  eye, 
the  blink  is  a response. 


The  fraction  of  a second  which  elapses  between 
"brushing  the  eye"  and  "blink"  is  the  latency  of 
the  reflex. 


In  the  patellar-tendon  reflex,  a forceful  tap 
elicits  a strong  kick;  a tap  barely  above  the 
threshold  elicits  a weak  kick.  Magnitude  of 
response  depends  on  the  intensity  of  the  stimulus. 


The  magnitude  of  a response  corresponds  to  (is 

a function  of)  the  _____  of  the  stimulus 

which  elicits  it. 
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VERSION  4 (Con't) 


intensity 

(magnitude, 

strength) 


A reflex  consists  of  both  a stimulus  and  a response, 
occurring  in  that  order.  The  term  "reflex"  is  not 
synonymous  with  the  single  term  "response." 


When  a person  is  startled  by  a loud  noise,  his 
sudden  movement  is  his  response  to  the  noise 
which  has  acted  as  a stimulus.  The  two  together 
are  called  a reflex. 


If  a sip  of  a very  weak  lemonade  does  not  cause 
salivation,  the  stimulus  is  said  to  be  below 
the  threshold. 


In  the  salivary  reflex,  the  stimulus  (food) 
precedes  the  responses  (secretion  of  saliva) 
by  an  interval  of  time  called  the  latency. 


If  an  actress  uses  onion  juice  on  her  handkerchief 
to  elicit  tears  during  an  emotional  scene,  she 
must  use  enough  to  exceed  the  threshold. 


The  greater  the  concentration  of  onion  juice  (stimulus), 
the  greater  the  magnitude  of  the  response. 


When  speaking  technically,  instead  of  saying 
onion  juice  "stimulates"  tears;  we  say  onion 
juice  "elicits"  tears. 


A child  touching  a hot  surface  withdraws  his 
hand  quickly.  The  word  "quickly"  suggests 
that  the  response  has  a short  . 


VERSION  4 (Con't) 


latency 


In  withdrawing  the  hand  from  the  hot  surface, 
arm  movement  is  a response  which  is  elicited 
by  a painful  stimulus  to  the  hand. 


In  the  hand-withdrawal  reflex,  the  stimulus  must 
be  intense  enough  to  exceed  the  threshold  or  no 
response  will  occur. 


A light  flashed  into  the  eye  elicits  a constriction 
of  the  pupil.  This  sequence  is  called  pupillary 
reflex. 


In  the  pupillary  reflex,  a flash  of  light  is  said 
to  elicit  response. 


In  the  pupillary  reflex,  a very  bright  flash  of 
light  elicits  a response  of  greater  magnitude 
than  a weak  flash  of  light. 


A response  and  its  eliciting  stimulus  comprise 
a reflex. 


The  softest  touch  on  the  surface  of  the  eye  needed 
to  elicit  a blink,  marks  the  threshold  of  the 
stimulus . 


In  any  reflex,  there  is  a stimulus  which 
the  response. 


elicits 
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VERSION  5 


A doctor  taps  your  knee  (patellar-tendon)  with 
a rubber  mallet  to  test  your  reflexes. 


If  your  reflexes  are  normal,  your  leg  responds 
to  the  tap  on  the  knee  with  a slight  jerk  (the 
so-called  knee-jerk). 


In  the  knee-jerk  or  patellar-tendon  reflex, 
the  kick  of  the  leg  is  the  response  to  the 
tap  on  the  knee. 


The  stimulating  object  used  by  the  doctor  to 
elicit  a knee-jerk  is  a hammer. 


The  stimulus  which  elicits  a knee-jerk  is  the 
tap  delivered  by  the  so-called  stimulus  object 
or  hammer. 


In  the  knee-jerk  reflex,  we  call  the  rubber  hammer 
the  stimulus  object  and  the  tap  or  blow  the  stimulus. 


An  event  is  explained  when  its  cause  is  identified. 

The  "cause"  or  explanation  of  the  knee-jerk  is  technically 
the  stimulus  which  elicits  it. 


Technically  speaking,  a reflex  involves  an 
eliciting  stimulus  in  a process  called  an 
elicitation.  A stimulus  elicits  a response. 
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VERSION  5 (Con't) 


In  a reflex,  the  stimulus  and  the  elicited 
response  occur  in  a given  temporal  order; 
first  the  stimulus  then  the  response. 


A kick  of  the  leg  is  elicited  by  a tap  on 
the  patellar-tendon. 


A time  which  elapses  between  the  onset  of  the 
stimulus  and  the  onset  of  the  response  is  called 
the  latency.  Thus  the  time  between  the  tap  and 
the  kick  is  the  latency  of  the  knee-jerk  reflex. 


The  weakest  stimulus  sufficient  to  elicit  a response 
marks  the  threshold  of  the  reflex.  A tap  on  the 
knee  will  not  elicit  a kick  if  it  is  below  the 
threshold . 


If  you  blink  when  something  brushes  your  eye, 
the  blink  is  a response. 


The  fraction  of  a second  which  elapses  between 
"brushing  the  eye"  and  "blink"  is  the  latency  of 
che  reflex. 


In  the  patellar-tendon  reflex,  a forceful  tap 
elicits  a strong  kick;  a tap  barely  above  the 
threshold  elicits  a weak  kick.  Magnitude  of 
response  depends  on  the  intensity  of  the  stimulus. 


The  magnitude  of  a response  corresponds  to  (is 

a function  of)  the  of  the  stimulus 

which  elicits  it. 
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VERSION  5 (Con't) 


intensity 

(magnitude, 

strength) 


A reflex  consists  of  both  a stimulus  and  a response, 
occurring  in  that  order.  The  term  "reflex"  is  not 
synonymous  with  the  single  term  "response." 


When  a person  is  startled  by  a loud  noise,  his 
sudden  movement  is  his  response  to  the  noise 
which  has  acted  as  a stimulus.  The  two  together 
are  called  a reflex. 


If  a sip  of  a very  weak  lemonade  does  not  cause 
salivation,  the  stimulus  is  said  to  be  below 
the  threshold. 


In  the  salivary  reflex,  the  stimulus  (food) 
precedes  the  responses  (secretion  of  saliva) 
by  an  interval  of  time  called  the  latency. 


If  an  actress  uses  onion  juice  on  her  handkerchief 
to  elicit  tears  during  an  emotional  scene,  she 
must  use  enough  to  exceed  the  threshold. 


The  greater  the  concentration  of  onion  juice  (stimulus), 
the  greater  the  magnitude  of  the  response. 


When  speaking  technically,  instead  of  saying 
onion' juice  "stimulates"  tears,  we  say  onion 
juice  "elicits"  tears. 


A child  touching  a hot  surface  withdraws  his 
hand  quickly.  The  word  "quickly"  suggests 
that  the  response  has  a short  latency. 
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VERSION  5 (Con't) 


In  withdrawing  the  hand  from  the  hot  surface, 
am  movement  is  a response  which  is  elicited 
by  a painful  stimulus  to  the  hand. 


In  the  hand-withdrawal  reflex,  the  stimulus  must  be 
intense  enough  to  exceed  the  threshold  or  no  response 
will  occur. 


A light  flashed  into  the  eye  elicits  constriction 
of  the  pupil.  This  sequence  is  called  pupillary 
reflex. 


In  the  pupillary  reflex,  a flash  of  light  is  said 
to  elicit  response. 


In  the  pupillary  reflex,  a very  bright  flash  of 
light  elicits  a response  of  greater  magnitude 
than  a weak  flash  of  light. 


A response  and  its  eliciting  stimulus  comprise 
a reflex. 


The  softest  touch  on  the  surface  of  the  eye  needed 
to  elicit  a blink,  marks  the  threshold  of  the 
stimulus . 


In  any  reflex,  there  is  a stimulus  which 
the  response. 


elicits 
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VERSION  6 


A doctor  taps  your  knee  (patellar-tendon)  with 
a rubber  mallet  to  test  your  reflexes. 


If  your  reflexes  are  normal,  your  leg  responds 
to  the  tap  on  the  knee  with  a slight  jerk  (the 
so-called  knee-jerk). 


In  the  knee  jerk  or  patellar-tendon  reflex, 
the  kick  of  the  leg  is  the  response  to  the 
tap  on  the  knee. 


The  stimulating  object  used  by  the  doctor  to 
elicit  a knee-jerk  is  a hammer. 


The  stimulus  which  elicits  a knee-jerk  is  the 
tap  delivered  by  the  so-called  stimulus  object 
or  hammer. 


In  the  knee-jerk  reflex,  we  call  the  rubber  hammer 
the  stimulus  object  and  the  tap  or  blow  the  stimulus. 


An  event  is  explained  when  its  cause  is  identified. 
The  "cause"  or  explanation  of  the  knee-jerk  is 
technically  the  stimulus  which  elicits  it. 


Technically  speaking,  a reflex  involves  an 
eliciting  stimulus  in  a process  called  an 
elicitation.  A stimulus  elicits  a response. 
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VERSION  6 (Con't) 


In  a reflex,  the  stimulus  and  the  elicited 
response  occur  in  a given  temporal  order; 
first  the  stimulus  then  the  response. 


A kick  of  the  leg  is  elicited  by  a tap  on 
the  patellar-tendon. 


The  time  which  elapses  between  the  onset  of  the 
stimulus  and  the  onset  of  the  response  is  called 
the  latency.  Thus  the  time  between  the  tap  and 
the  kick  is  the  latency  of  the  knee-jerk  reflex. 


The  weakest  stimulus  sufficient  to  elicit  a response 
marks  the  threshold  of  the  reflex.  A tap  on  the 
knee  will  not  elicit  a kick  if  it  is  below  the 
threshold. 


If  you  blink  when  something  brushes  your  eye , 
the  blink  is  a response. 


The  fraction  of  a second  which  elapses  between 
"brushing  the  eye"  and  "blink"  is  the  latency 
of  the  reflex. 


In  the  patellar-tendon  reflex,  a forceful  tap 
elicits  a strong  kick;  a tap  barely  above  the 
threshold  elicits  a weak  kick.  Magnitude  of 
response  depends  on  the  intensity  of  the  stimulus. 


The  magnitude  of  a response  corresponds  to  (is 
a function  of)  the  intensity  of  the  stimulus 
which  elicits  it. 
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VERSION  6 (Con't) 


A reflex  consists  of  both  a stimulus  and  a response 
occurring  in  that  order.  The  term  "reflex"  is  not 
synonymous  with  the  single  term  "response." 


When  a person  is  startled  by  a loud  noise,  his 
sudden  movement  is  his  response  to  the  noise 
which  has  acted  as  a stimulus.  The  two  together 
are  called  a reflex. 


If  a sip  of  a very  weak  lemonade  does  not  cause 
salivation,  the  stimulus  is  said  to  be  below 
the  threshold. 


In  the  salivary  reflex,  the  stimulus  (food) 
precedes  the  responses  (secretion  of  saliva) 
by  an  interval  of  time  called  the  latency. 


If  an  actress  uses  onion  juice  on  her  handkerchief 
to  elicit  tears  during  an  emotional  scene,  she 
must  use  enough  to  exceed  the  threshold. 


The  greater  the  concentration  of  onion  juice  (stimulus) , 
the  greater  the  magnitude  of  the  response. 


When  speaking  technically,  instead  of  saying 
onion  juice  "stimulates"  tears;  we  say  onion 
juice  "elicits"  tears. 


A child  touching  a hot  surface  withdraws  his 
hand  quickly.  The  word  "quickly"  suggests 
that  the  response  has  a short  latency. 
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VERSION  6 (Con't) 


In  withdrawing  the  hand  from  the  hot  surface, 
arm  movement  is  a response  which  is  elicited 
by  a painful  stimulus  to  the  hand. 


In  the  hand-withdrawal  reflex,  the  stimulus  must  be 
intense  enough  to  exceed  the  threshold  or  no  response 
will  occur. 


A light  flashed  into  the  eye  elicits  a constriction 
of  the  pupil.  This  sequence  is  called  pupillary 
reflex. 


In  the  pupillary  reflex,  a flash  of  light  is  said 
to  elicit  a response. 


In  the  pupillary  reflex,  a very  bright  flash  of 
light  elicits  a response  of  greater  magnitude 
than  a weak  flash  of  light. 


A response  and  its  eliciting  stimulus  comprise 
a reflex. 


The  softest  touch  on  the  surface  of  the  eye  needed 
to  elicit  a blink,  marks  the  threshold  of  the 
stimulus . 


In  any  reflex,  there  is  a stimulus  which 
the  response. 
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VERSION  7 


A doctor  taps  your  knee  (patellar-tendon)  with 
a rubber  mallet  to  test  your  reflexes. 


If  your  reflexes  are  normal,  your  leg  responds 
to  the  tap  on  the  knee  with  a slight  jerk  (the 
so-called  knee  jerk). 


In  the  knee  jerk  or  patellar-tendon  reflex, 
the  kick  of  the  leg  is  the  response  to  the 
tap  on  the  knee. 


The  stimulating  object  used  by  the  doctor  to 
elicit  a knee  jerk  is  a hammer. 


The  stimulus  which  elicits  a knee  jerk  is  the 
tap  delivered  by  the  so-called  stimulus  object 
or  hammer. 


In  the  knee  jerk  reflex,  we  call  the  rubber  hammer 
the  stimulus  object  and  the  tap  or  blow  the  stimulus. 


An  event  is  explained  when  its  cause  is  identified. 
The  "cause"  or  explanation  of  the  knee  jerk  is 
technically  the  stimulus  which  elicits  it. 


Technically  speaking,  a reflex  involves  an 
eliciting  stimulus  in  a process  called  an 
elicitation.  A stimulus  elicits  a response. 
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VERSION  7 (Con't) 


In  a reflex,  the  stimulus  and  the  elicited 
response  occur  in  a given  temporal  order; 
first  the  stimulus  then  the  response. 


A kick  of  the  leg  is  elicited  by  a tap  on 
the  patellar- tendon. 


The  time  which  elapses  between  the  onset  of  the 
stimulus  and  the  onset  of  the  response  is  called 
the  latency.  Thus  the  time  between  the  tap  and 
the  kick  is  the  latency  of  the  knee-jerk  reflex. 


The  weakest  stimulus  sufficient  to  elicit  a response 
marks  the  threshold  of  the  reflex.  A tap  on  the 
knee  will  not  elicit  a kick  if  it  is  below  the 
threshold. 


If  you  blink  when  something  brushes  your  eye, 
the  blink  is  a response. 


The  fraction  of  a second  which  elapses  between 
"brushing  the  eye"  and  "blink"  is  the  latency 
of  the  reflex. 


In  the  patellar-tendon  reflex,  a forceful  cap 
elicits  a strong  kick;  a tap  barely  above  the 
threshold  elicits  a weak  kick.  Magnitude  of 
response  depends  on  the  intensity  of  the  stimulus. 


The  magnitude  of  a response  corresponds  to  (is 
a function  of)  the  intensity  of  the  stimulus 
which  elicits  it. 


-109- 


VERSION  7 (Con't) 


A reflex  consists  of  both  a stimulus  and  a response 
occurring  in  that  order.  The  term  "reflex"  is  not 
synonymous  with  the  single  term  "response." 


When  a person  is  startled  by  a loud  noise,  his 
sudden  movement  is  his  response  to  the  noise 
which  has  acted  as  a stimulus.  The  two  together 
are  called  a reflex. 


If  a sip  of  a very  weak  lemonade  does  not  cause 
salivation,  the  stimulus  is  said  to  be  below 
the  threshold. 


In  the  salivary  reflex,  the  stimulus  (food) 
precedes  the  responses  (secretion  of  saliva) 
by  an  interval  of  time  called  the  latency. 


If  an  actress  uses  onion  juice  on  her  handkerchief 
to  elicit  tears  during  an  emotional  scene,  she 
must  use  enough  to  exceed  the  threshold. 


The  greater  the  concentration  of  onion  juice  (stimulus), 
the  greater  the  magnitude  of  the  response. 


When  speaking  technically,  instead  of  saying 
onion  juice  "stimulates"  tears;  we  say  onion 
juice  "elicits"  tears. 


A child  touching  a hot  surface  withdraws  his 
hand  quickly.  The  word  "quickly"  suggests 
that  the  response  has  a short  latency. 
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VERSION  7 (Con't) 


In  withdrawing  the  hand  from  the  hot  surface, 
arm  movement  is  a response  which  is  elicited 
by  a painful  stimulus  to  the  hand. 


In  the  hand-withdrawal  reflex,  the  stimulus  must  be 
intense  enough  to  exceed  the  threshold  or  no  response 
will  occur. 


A light  flashed  into  the  eye  elicits  a constriction 
of  the  pupil.  This  sequence  is  called  pupillary 
reflex. 


In  the  pupillary  reflex,  a flash  of  light  is  said 
to  elicit  response. 


In  the  pupillary  reflex,  a very  bright  flash  of 
light  elicits  a response  of  greater  magnitude 
than  a weak  flash  of  light. 


A response  and  its  eliciting  stimulus  comprise 
a reflex. 


The  softest  touch  on  the  surface  of  the  eye  needed 
to  elicit  a blink,  marks  the  threshold  of  the 
stimulus . 


In  any  reflex,  there  is  a stimulus  which  elicits 
the  response. 


APPENDIX  C : INSTRUMENTS 
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PSYCHOLOGICAL  ACHIEVEMENT  TEST 
FORM  A 


- Do  not  write  on  this  test  booklet  - 


- Write  only  on  the  answer  cards  provided  - 


- Use  only  the  special  pencil  - 


INSTRUCTIONS : 


1.  This  is  a multiple  choice  test  in  psychology.  There  is  no 
time  limit  for  this  test.  Most  students  will  finish  in  45 
to  60  minutes. 


2.  Complete  both  sides  of  the  yellow  STUDENT  DATA  CARD  following 
the  instructions  on  the  card. 


3.  Write  your  name  and  grade  as  indicated  on  the  front  of  the 
white  TEST  ANSWER  CARD. 


4.  Fill  in  the  bubble  labeled  "Part  I"  and  answer  questions  1-50 
on  the  front  of  the  TEST  ANSWER  CARD.  Fill  in  the  bubble 
labeled  "Part  II"  and  answer  questions  51-64  on  the  back  of 
the  TEST  ANSWER  CARD. 


5.  If  you  have  any  questions  about  how  to  take  this  examination, 
ask  the  monitor. 


- DO  NOT  WRITE  ON  THIS  TEST  BOOKLET  - 


Prepared  by  J.  M.  Flynn,  Nova  University,  1968 
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1.  During  an  air  attack  the  odor  of  a bomb  shelter  may  be  paired  with 
the  stimuli  which  elicits  fear  during  the  attack.  If  so,  this  will  be 
an  example  of 

1)  arranged  operant  conditioning. 

2)  arranged  respondent  conditioning. 

3)  accidental  operant  conditioning. 

4)  accidental  respondent  conditioning. 

2.  Head  scratching,  pencil  chewing  and  table  tapping  by  a student  while 
he  is  solving  a problem  are  most  likely  to  be  a form  of 

1)  unconditioned  problem  solving  behavior, 

2)  superstitious  problem  solving  behavior. 

3)  generalized  problem  solving  behavior. 

4)  discriminative  problem  solving  behavior. 

3.  A thirsty  man  who  sees  a drinking  fountain  will  probably  go  to  the 
fountain  because 

1)  walking  to  the  fountain  has  been  reinforced  in  the  past  when 
he  was  thirsty. 

2)  walking  to  the  fountain  has  been  extinguished  in  the  past  when 
he  was  thirsty. 

3)  walking  to  the  fountain  is  an  unconditioned  response. 

4)  the  fountain  is  an  unconditioned  stimulus. 

4.  A green  key  is  an  SA  for  pecking  by  a pigeon,  and  a red  key  is  an 

in  a fixed  ratio  schedule  of  25  responses.  After  the  key  turns  green, 

1)  no  responses  are  reinforced. 

2)  all  responses  are  reinforced. 

3)  the  twenty-fifth  response  is  reinforced. 

4)  The  twenty-sixth  response  is  reinforced. 

5.  In  a typical  operant  conditioning  experiment,  the  experimenter  arranges 
a mechanical  or  electrical  device  to 

1)  reinforce  any  response. 

2)  reinforce  a specific  response. 

3)  Reinforce  occasional  responses  regardless  of  their  nature. 

4)  reinforce  any  frequently  occurring  response. 

6.  In  the  usual  operant  experiment,  bending  down  to  the  magazine  makes  the 
food  visible  and  the  pigeon  seizes  the  food  with  its  beak.  Which  of  the 
following  statements  most  correctly  describes  the  situation? 

1)  With  respect  to  seizing  food,  the  sight  of  food  is  a SA. 

2)  With  respect  to  bending  down,  the  sight  of  food  is  a reinforcer. 

3)  With  respect  to  seizing  food,  the  sigh  of  food  is  a SD. 

4)  Statements  2.  & 3.  above  are  correct,  but  not  statement  1. 
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7.  The  highest  rate  of  responding  is  generated  by  which  of  the  following 
schedules  of  reinforcement 

1)  variable-ratio 

2)  fixed-interval 

3)  variable-interval 

4)  fixed-ratio 

8.  A chimpanzee  drops  coins  into  a vending  machine  which  delivers 
peanuts.  The  peanuts  are 

1)  conditioned  reinforcers 

2)  secondary  reinforcers 

3)  unconditioned  reinforcers 

4)  accidental  reinforcers 

9.  If  a drill  instructor  says  "right  face!"  and  all  of  his  men  turn  to  the 
right,  it  can  be  said  their  behavior 

1)  is  under  the  control  of  a verbal  stimulus. 

2)  is  an  unconditioned  response. 

3)  is  reinforced  on  a fixed-ratio  schedule. 

4)  demonstrates  complete  stimulus  generalization. 

10.  A particular  schedule  of  reinforcement  always  has  325  responses  between 
reinforcements.  This  schedule  is  called 

1)  variable-interval  schedule 

2)  variable-ratio  schedule 

3)  fixed-interval  schedule 

4)  fixed-ratio  schedule 

11.  In  teaching  a child  to  copy  a picture,  if  we  reinforce  only  a perfect 
drawing,  we  can  most  likely  expect  which  one  of  the  following  results? 

1)  His  drawing  behavior  will  be  extinguished. 

2)  His  drawing  behavior  will  rapidly  improve. 

3)  His  drawing  behavior  will  be  slowly  improved. 

4)  While  his  drawing  behavior  will  not  be  extinguished,  but  it  will 
not  improve. 

12.  Smooth  muscle  fibers  are  found  in  which  one  of  the  following: 

1)  the  arms 

2)  the  stomach 

3)  the  neck 

4)  the  ankle 
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13.  In  the  standard  experimental  box  the  click  of  the  food  magazine  is 
an  effective  reinforcer  for  pecking  because 

1)  the  click  occurs  instantly  but  there  is  a short  delay  be- 
fore reinforcement  with  food. 

2)  food  occurs  instantly  but  there  is  a short  delay  before 
reinforcement  with  the  click. 

3)  reinforcement  by  the  click  and  by  the  food  both  occur  in- 
stantly. 

4)  reinforcement  with  the  click  and  the  food  are  both  delayed. 

14.  The  technique  of  reinforcing  a response  in  the  presence  of  one  sti- 
mulus and  not  reinforcing  it  in  the  presence  of  another  stimulus  is 
used  to  establish 

1)  an  adaption 

2)  a superstition 

3)  a discrimination 

4)  a generalized  reinforcer 

15.  The  term  which  best  describes  the  situation  in  which  some  responses 
are  reinforced  while  some  rather  similar  responses  are  not  reinforced 
is 

1)  continuous  reinforcement 

2)  differential  reinforcement 

3)  accidental  reinforcement 

4)  generalized  reinforcement 

16.  A child  has  a temper  tantrum  and  screams  for  candy.  The  mother  gives 
the  candy  to  the  child  and  the  tantrum  ceases.  Which  one  of  the  fol- 
lowing statements  best  describes  this  situation? 

1)  Termination  of  the  tantrum  negatively  reinforces  the  mother's 
response  of  handing  candy  to  the  child. 

2)  The  termination  of  the  tantrum  positively  reinforces  the 
mother's  response  of  handing  candy  to  the  child. 

3)  Handing  candy  to  the  child  negatively  reinforces  the  tantrum. 

4)  Neither  the  mother's  response  of  handing  candy  to  the  child 
nor  the  child's  tantrum  are  reinforced  at  all. 

.17.  If  only  red  apples  are  always  sweet  and  edible  and  green  apples  are 
never  sweet  and  edible,  which  of  the  following  statements  is  correct 
for  the  response  of  picking  and  eating  an  apple? 

1)  The  "redness"  or  "greenness"  of  an  apple  is  an  SD. 

2)  The  "redness"  or  "greenness"  of  an  apple  is  an  S^. 

3)  The  "redness"  of  an  apple  is  an  SA  and  the  "greenness"  of 

an  apple  is  an  S®. 

The  "redness"  of  an  apple  is  an  SD  and  the  "greenness"  of 
an  apple  is  an  S^. 


4) 
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18.  Vomiting  due  to  chemical  irritants  in  the  stomach  is: 

1)  a conditioned  response. 

2)  an  unconditioned  response. 

3)  a conditioned  stimulus. 

4)  an  unconditioned  stimulus. 

19.  In  a chain  of  behavior,  the  presence  of  food 

1)  can  be  a discriminative  stimulus,  but  not  a reinforcing 
stimulus . 

2)  can  be  a reinforcing  stimulus,  but  not  a discriminative 
stimulus . 

3)  can  be  both  a discriminative  and  reinforcing  stimulus. 

4)  can  be  neither  a discriminative  nor  reinforcing  stimulus. 

20.  Which  of  the  following  statements  is  true  of  continuous  reinforce- 
ment: 

1)  Every  response  is  reinforced  and  the  time  required  for  ex- 
tinction is  long. 

2)  Many,  but  not  all,  of  the  responses  are  reinforced  and  the 
time  required  for  extinction  is  short. 

3)  Every  response  is  reinforced  and  the  time  required  for  ex- 
tinction is  short. 

4)  Many,  but  not  all,  of  the  responses  are  reinforced  and  the 
time  required  for  extinction  is  long. 

21.  In  a typical  operant  conditioning  experiment,  a pigeon  might  be  placed 
in  a box  which  has  a key  for  pecking  and  a food  magazine.  To  condi- 
tion operant  behavior  the  apparatus 

1)  pairs  a conditioned  stimulus  with  the  presentation  of  food 
in  the  magazine. 

2)  reinforces  any  response  which  occurs  by  presenting  the  animal 
with  food. 

3)  elicits  specific  responses  by  presentation  of  food. 

4)  reinforces  specific  responses  when  they  are  emitted. 

22.  Even  though  reinforcement  can  only  occur  after  a response,  the  shortest 
interval  in  a variable- interval  schedule 

1)  may  not  be  any  smaller  than  the  average  interval. 

2)  may  be  so  small  that  one  reinforcement  occurs  immediately 
after  another  reinforcement. 

3)  must  be  at  least  one  minute  in  length. 

4)  is  the  same  length  as  the  longest  interval  because  all  of  the 
intervals  are  the  same  length. 

23.  If  a pigeon's  key  pecking  response  is  not  followed  by  reinforcement, 
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key  pecking  responses  will  occur 

1)  less  frequently  in  the  future  than  if  the  response  had 
been  followed  by  reinforcement. 

2)  more  frequently  in  the  future  than  if  the  response  had 
been  followed  by  reinforcement. 

3)  as  frequently  in  the  future  as  if  the  response  had  been 
followed  by  reinforcement. 

4)  no  statement  can  be  made  about  future  responses  since  the 
frequency  depends  upon  the  particular  animal. 

Read  the  following  before  answering  questions  24  and  25: 

In  a standard  respondent  conditioning  experiment  a tone  was  paired  with 
a meat  powder  and  the  saliva  flow  of  the  dog  was  measured.  There  were 
60  trials.  On  trials  1,  12,  24,  36,  48  and  60,  the  tone  was  presented 
alone  (i.e.,  meat  powder  was  not  used).  The  table  below  shows  the 
amount  of  saliva  flow  and  the  time  between  onset  of  tone  and  salivation 
on  each  of  these  test  trials. 


Trial  No. 


Drops  of  Saliva 


Time  Between  Onset  of 
Tone  and  Salivation 
(in  seconds) 


10 

12  8 15 

24  27  7 

36  68  3 

48  72  2 

60  71  2 


24.  On  Trial  12,  the  tone  alone  was  presented  and  8 drops  of  saliva 
flowed.  This  is  evidence  that  the  tone  was  already: 


1)  a conditioned  response 

2)  a conditioned  stimulus 

3)  an  unconditioned  response 

4)  an  unconditioned  stimulus 


25.  From  this  experiment,  we  can  conclude  that  as  the  number  of  trials 
in  which  the  conditioned  and  unconditioned  stimuli  are  paired 
increases : 


1)  the  latency  of  the  conditioned  reflex  increases  and  the 
magnitude  of  the  conditioned  response  increases  until  both 
reach  a limit. 

2)  the  latency  of  the  conditioned  reflex  decreases  and  the 
magnitude  of  the  conditioned  response  decreases  until  both 
reach  a limit. 

3)  the  latency  of  the  conditioned  reflex  decreases  and  the 
magnitude  of  the  conditioned  response  increases  until  both 
reach  a limit. 
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4)  The  latency  of  the  conditioned  reflex  increases  and 
the  magnitude  of  the  conditioned  response  decreases 
until  both  reach  a limit. 

Read  the  following  figure  and  use  it  for  questions  26  and  27. 
RECORD  OF  A PIGEON  ON  A FIXED  INTERVAL  SCHEDULE 


26.  At  which  of  the  following  points  in  the  above  figure  were  the 
responses  reinforced? 

1)  A,  B,  and  C 

2)  A,  B,  and  D 

3)  B,  D,  and  E 

4 ) A , B , and  E 

27 . At  which  of  the  following  intervals  in  the  above  figure  is  the 

rate  the  lowest? 

1)  between  C and  D 

2)  between  D and  E 

3)  between  B and  C 

4)  between  A and  B 

28.  If  a cigarette  lighter  fails  to  light  and  the  person  stops  trying 
to  light  it  after  a very  few  attempts  , we  can  most  probably  assume 
that  his  previous  lighting  responses  were  reinforced  on  a 

1)  continuous  schedule 

2)  variable-interval  schedule 

3)  variable-ratio  schedule 

4)  fixed-ratio  schedule 

29.  If  in  teaching  a student  to  throw  a javelin  the  track  coach  is 
satisfied  with  every  throw  no  matter  how  bad  it  may  be,  he  is 
using  which  of  the  following  techniques. 
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1)  he  is  using  successive  approximation,  but  he  is  not 
using  differential  reinforcement. 

2)  he  is  not  using  successive  approximation,  but  he  is 
using  differential  reinforcement. 

3)  he  is  using  successive  approximation  and  he  is  using 
differential  reinforcement. 

4)  he  is  not  using  successive  approximation  and  he  is  not 
using  differential  reinforcement. 

‘30.  Which  one  of  the  following  statements  about  reinforcement  is 
correct? 

1)  Positive  reinforcement  increases  the  rate  of  response 
while  negative  reinforcement  decreases  the  rate  of  response. 

2)  Both  positive  and  negative  reinforcement  increase  the 
rate  of  response. 

3)  Positive  reinforcement  decreases  the  rate  of  response  and 

negative  reinforcement  increases  the  rate  of  responses. 

4)  Both  positive  and  negative  reinforcement  decrease  the 
rate  of  response. 

31.  When  reinforcements  are  never  close  together  in  time,  which  of  the 
following  situations  would  most  likely  occur  immediately  after 
reinforcement? 

1)  Rapid  responding 

2)  A few  responses  followed  by  a long  pause. 

3)  A single  response  followed  by  a long  pause. 

4)  A pause  in  responding. 

32.  In  shaping  animal  behavior  such  as  that  of  a dog  the  first  step  is 

often  to  establish  a sound  such  as  a click  as  a reinforcer  by 

repeatedly  pairing  it  with  food.  One  reason  for  using  an  auditory 
stimulus  instead  of  food  in  training  the  dog  is 

1)  the  auditory  stimulus  is  a primary  reinforcer  whereas 
the  food  is  not. 

2)  the  auditory  stimulus  affects  the  dog  immediately  whereas 
the  food  may  not. 

3)  the  auditory  stimulus  is  a generalized  reinforcer  whereas 
the  food  is  not. 

4)  the  click  can  be  used  for  continuous  reinforcement  whereas 
the  food  can  only  be  used  on  a ratio  schedule. 

33.  If  we  consistently  get  no  answer  when  we  dial  a number  on  the 
telephone  we  stop  dialing.  This  process  is  called 
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1)  negative  reinforcement 

2)  positive  reinforcement 

3)  extinction 

4)  adaption 

34.  When  you  telephone  a- friend  not  at  home,  your  reinforcement 
by  getting  an  answer  is  on 

1)  a ratio-schedule. 

2)  an  interval-schedule. 

3)  a continuous  reinforcement  schedule. 

4)  no  particular  reinforcement  schedule. 

35.  Which  of  the  following  methods  will  most  likely  be  used  to 
shape  a complex  skill? 

1)  generalized  reinforcement  of  unconditioned  responses 

2)  pairing  of  conditioned  stimuli  with  unconditioned  stimuli 

3)  differential  reinforcement  of  successive  approximations  to 
the  final  behavior 

4)  continuous  reinforcement  of  generalized  behavior  through 
the  use  of  secondary  reinforcers 

36.  The  adage  "practice  makes  perfect"  is  probably  true  only  if  some 
product  of  one's  behavior  provides 

1)  generalized  reinforcement. 

2)  primary  reinforcement. 

3)  differential  reinforcement. 

4)  accidental  reinforcement. 

37 . A dog  has  been  conditioned  to  salivate  at  the  sound  of  a tuning 
fork.  Later,  the  experimenter  wants  to  extinguish  the  conditioned 
response.  Which  of  the  following  methods  would  lead  to  extinction: 

1)  Repeated  presentation  of  the  tuning  fork  with  food  powder. 

2)  Repeated  presentation  of  food  powder  without  the  tuning  fork. 

3)  Repeated  presentation  of  the  tuning  fork  without  meat  powder. 

4)  Repeated  presentation  of  meat  powder  occasionally  paired  with 
the  tuning  fork. 

38.  Which  of  the  following  is  true  for  a multiple  schedule? 

1)  Behavior  in  a multiple  schedule  can  only  be  reinforced  by 
primary  reinforcers. 

2)  A multiple  schedule  can  consist  only  of  a fixed  interval 
schedule  alternating  with  a fixed  ratio  schedule. 

3)  In  a multiple  schedule,  behavior  comes  under  stimulus  control. 

4)  Reinforcement  in  a multiple  schedule  can  only  be  by  conditioned 
reinforcers . 
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39.  A response  is  said  to  have  become  more  probable  if  under 
controlled  conditions 

1)  it  is  observed  to  occur  more  frequently  than  it  did 
at  some  previous  time. 

2)  it  is  observed  to  occur  less  frequently  than  at  some  previous 
time . 

3)  its  latency  is  observed  to  increase. 

4)  its  magnitude  is  observed  to  increase. 

40.  One  way  in  which  to  train  a dog  not  to  sit  still  is  by 
withholding  reinforcement  until  the  dog  moves.  The  term  which 
best  describes  this  technique  is  called 

1)  accidental  reinforcement 

2)  primary  reinforcement 

3)  continuous  reinforcement 

4)  differential  reinforcement 

41.  If  a pigeon  is  placed  in  an  experimental  box  for-  the  first  time, 
which  of  the  following  pecking  response  patterns  is  most  likely 
to  occur  before  the  pigeon  becomes  accustomed  to  the  situation? 

1)  Relatively  long  pauses  between  responses. 

2)  extremely  short  pauses  between  responses. 

3)  alternating  long  and  short  pauses  between  responses. 

4)  none  of  the  above. 

42.  If  the  time  between  pecking  responses  for  a pigeon  placed  in  an 
experimental  box  for  the  first  time  and  under  a schedule  of 
continuous  reinforcement  were  plotted,  which  of  the  following 
figures  would  most  nearly  resemble  the  resultant  graph? 


1) 


3) 


2)  T I 

■i  ! 
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Refer  to  the  following  figures  for  questions  43  and  44. 


The  figures  shown  are  the  cumulative  records  of  conditioned 
key  pecking  by  two  pigeons.  All  responses  were  reinforced. 
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Time 


43.  In  Figure  A,  Pigeon  A reached  his  maximal  rate  of  responding 
after 

1)  1 reinforcement 

2)  4 reinforcements 

3)  8 reinforcements 

4)  11  reinforcements 

44.  The  record  for  Pigeon  B differs  from  that  for  Pigeon  A mainly 
in  the  fact  that 

1)  the  maximal  rate  of  responding  was  significantly 
faster. 

2)  the  maximal  rate  of  responding  was  significantly  slower. 

3)  a longer  time  elapsed  between  the  first,  second  and 
third  responses. 

4)  the  responses  for  Pigeon  B occur  more  frequently. 

45.  Which  of  the  following  are  a direct  result  of  actions  of  the 
glands: 

1)  vomiting 

2)  sweating 

3)  blushing 

4)  breathing 

46.  A bowler  may  snap  his  fingers  just  as  the  ball  hits  the  pins. 

If  he  gets  a "strike" 

1)  his  finger  snapping  will  not  be  reinforced  because 
this  behavior  has  no  effect  on  the  ball. 

2)  his  finger  snapping  will  be  reinforced  even  though 
this  behavior  has  no  effect  on  the  ball. 

3)  his  finger  snapping  will  not  be  reinforced  although 
this  behavior  does  have  an  effect  on  the  ball. 

4)  his  finger  snapping  will  be  reinforced  because  this 
behavior  does  have  an  effect  on  the  ball. 
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47.  "New"  forms  of  behavior  can  be  generated  by  differentially 

■ reinforcing  forms  which  gradually  approximate  the  final  form. 

We  call  this  procedure 

1)  shaping  the  behavior 

2)  extinguishing  the  behavior 

3)  adapting  the  behavior 

4)  emitting  the  behavior 

48.  When  a food  deprived  pigeon  is  placed  for  the  very  first  time  in 
a standard  experimental  box  "emotional"  behavior  is  elicited  by 
the  box.  This  behavior  is 

1)  conditioned  and  can  be  eliminated  through  extinction. 

2)  conditioned  and  can  be  eliminated  through  adaption. 

3)  unconditioned  and  can  be  eliminated  through  extinction. 

4)  unconditioned  and  can  be  eliminated  through  adaption. 

49.  Sometimes  the  click  made  by  the  empty  food  magazine  in  an  operant 
conditioning  experiment  acquires  the  power  to  reinforce  the  key 
pecking  response  of  the  pigeon.  In  this  case  we  will  call  the 
sound  of  the  magazine 

1)  an  unconditioned  reinforcer 

2)  a primary  reinforcer 

3)  a conditioned  reinforcer 

4)  a generalized  reinforcer 

50.  Stimuli  which  were  never  present  at  reinforcement 

1)  will  never  acquire  control  over  the  response. 

2)  may  acquire  control  over  the  response  through  accidental 
reinforcement . 

3)  frequently  acquire  control  over  the  response  although 
practically  nothing  is  known  about  the  conditions  under 
which  they  acquire  this  control. 

4)  may  acquire  control  over  the  response  if  a similar  stimulus 
has  been  present  during  reinforcement. 

51.  One  way  in  which  to  extinguish  behavior  you  do  not  want  another 
person  to  emit  is  by 

1)  showing  them  affection  when  they  emit  the  behavior 

2)  withholding  approval  when  they  do  not  emit  the  behavior 

3)  using  a generalized  reinforcer  when  they  emit  the  behavior 

4)  withholding  approval  when  they  emit  the  behavior 
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52.  A worker  in  an  electronics  factory  is  paid  $3  for  every 
lot  of  15  tuning  knobs  which  his  machine  produces.  Since 
he  is  reinforced  only  when  he  finishes  a lot  the  reinforce- 
ment is  on 

1)  a variable-ratio  schedule 

2)  a fixed  -interval  schedule 

3)  a continuous-reinforcement  schedule 

4)  a fixed-ratio  schedule 

53.  When  reinforcement  is  discontinued  after  a response  has  been 
maintained  on  a variable  interval  schedule,  responses 

1)  are  emitted  at  a gradually  declining  rate  for  a long 
period  of  time. 

2)  are  emitted  at  a rapidly  declining  rate  for  only  a short 
period  of  time. 

3)  are  emitted  at  a constantly  high  rate  for  an  indefinite 
period  of  time. 

4)  will  cease  to  be  emitted  almost  immediately. 

54.  Which  of  the  following  techniques  will  effectively  prevent  unwanted 
conditioned  behavior? 

1)  Condit  ion  some  incompatible  behavior. 

2)  Withhold  reinforcement  occasionally  when  the  behavior 
is  emitted. 

3)  Withhold  reinforcement  every  time  the  behavior  is  emitted. 

4)  Both  1 and  3 will  effectively  prevent  the  behavior  but  not  2. 

55.  Frequently  words  like  "bad"  and  "wrong"  elicit  emotional  responses 
in  a person.  These  responses  are: 

1)  unconditioned  responses. 

2)  conditioned  responses. 

3)  sometimes  unconditioned  responses  and  other  times  conditioned 
responses . 

4)  neither  unconditioned  nor  conditioned  responses. 

56.  A reinforcement  schedule  which  provides  reinforcements  after  a set 
number  of  responses  is  called 

1)  a variable-ratio  schedule 

2)  a fixed-ratio  schedule 

3)  a variable-interval  schedule 

4)  a fixed-interval  schedule 
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Refer  to  the  following  figure  for  questions  57  and  58.  The  figure 
is  the  cumulative  record  of  keypecking  responses  by  a pigeon. 


57.  Between  which  two  positions  in  the  above  figure  did  the  animal 
pause  without  responding  for  about  five  minutes? 

1)  a and  b 

2)  b and  c 

3)  c and  d 

4)  e and  f 

58.  In  the  above  figure  three  responses  were  made  fairly  close 
together.  Which  of  the  following  letters  identifies  these  three 
responses? 


1) 

a, 

b, 

and 

c 

2) 

c, 

d, 

and 

e 

3) 

b, 

c, 

and 

d 

4) 

d. 

e, 

and 

f 

59.  If  a conditioned  reinforcer  is  paired  with  several  different 
uncinditioned  reinforcers,  it  becomes 

1)  an  accidental  reinforcer 

2)  an  unconditioned  reinforcer 

3)  a generalized  reinforcer 

4)  a primary  reinforcer 

60.  Which  of  the  following  statements  is  the  best  psychological 
interpretation  of  the  question  "what  does  he  see  in  her?" 

1)  "How  does  he  reinforce  her  courting  behavior?" 

2)  "How  does  she  reinforce  his  courting  behavior?" 

3)  "How  does  she  extinguish  his  courting  behavior?" 

4)  "How  does  he  extinguish  her  courting  behavior?" 
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61.  When  a student  is  studying  he  may  persistently  scratch  his 
head  and  chew  on  his  pencil.  Frequently  these  behaviors  are 
conditioned  superstitious  operants  resulting  from 

1)  differential  reinforcements  of  successive  approximations 
to  a final  behavior. 

2)  the  pairing  of  unconditioned  stimuli  with  conditioned 
stimuli. 

3)  accidental  contingencies  of  reinforcement. 

4)  continuous  reinforcement  of  high  probability  behavior. 

62.  If  the  child  who  first  calls  all  books"green"  is  not  reinforced 
for  saying  "green"  to  books  of  other  colors,  he  will  soon  call  only 
green  books  "green".  Thus,  books  of  other  colors 

1)  come  to  provide  S°'s  for  the  response  of  saying  "green". 

2)  become  SA  for  the  response  of  saying  other  colors. 

3)  come  to  provide  SA's  fgr  the  response  of  saying  "green". 

4)  become  unconditioned  SU  for  eliciting  the  response  of 
saying  "green". 

63.  Which  of  the  various  schedules  of  reinforcement  do  the  following 

statements  describe:  There  is  a pause  after  reinforcement. 

Responding  then  resumes  and  is  positively  accelerated  until  it 
reaches  a terminal  steady  rate  before  next  reinforcement. 


64. 


1)  fixed-interval 

2)  variable-ratio 

3)  variable-interval 

4)  fixed-ratio 

A pigeon  has  been  given  food  each  time  it  pecks  a disk.  Later  when 
food  no  longer  follows  pecking,  the  rate  in  emitting  the  response 
is  observed  to  decline  gradually.  This  process  is  called 


1)  operant  conditioning 

2)  extinction 

3)  respondent  conditioning 

4)  adaption 


PSYCHOLOGY  QUESTIONNAIRE 


Please  answer  each  of  the  following  questions  by  circling  either  "yes" 
or  "no"  as  they  apply.  Answer  every  question  as  truthfully  as  possible. 


Yes 

No 

1. 

Have  you  ever  been  taught  in  school  anything  about 
psychology  (besides  the  materials  which  you  have  just 
completed) ? 

Yes 

No 

2. 

Have  you  ever  read  a book  about  psychology  (besides  the 
materials  which  you  have  just  completed)? 

Yes 

No 

3. 

Have  you  ever  read  a magazine  article  about  psychology? 

Yes 

No 

4. 

Is  anyone  you  know  a psychologist? 

Yes 

No 

5. 

Have  you  ever  visited  a psychological  laboratory? 

Yes 

No 

6 . 

Have  you  ever  talked  to  a psychologist  about  psychology? 

Yes 

No 

7. 

Have  you  ever  watched  a psychologist  work? 

Yes 

No 

8. 

Is  your  father  or  mother  a psychologist? 

Yes 

No 

9. 

Do  you  ever  talk  about  psychology  at  home  with  members 
of  your  family? 

Yes 

No 

10. 

Did  you  know  what  a psychologist  did  before  you  read  the 
materials  you  just  completed? 

Yes 

No 

11. 

Would  you  like  to  read  more  about  psychology? 

Yes 

No 

12. 

Would  you  rather  read  a regular  'textbook  on  psychology 
than  have  it  taught  by  the  materials  you  have  just 
completed? 

Yes 

No 

13. 

Did  you  like  the  way  psychology  was  taught  in  the 
materials  you  have  just  completed? 

Yes 

No 

14. 

Would  you  like  to  become  a psychologist? 
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NAME  (optional)  Grade Sex 


Please  check  the  following  items  which  apply: 

A.  Please  check  one : 

(1)  I did  not  participate  a_t  all  in  the  study. 

(2)  I participated  for  only  a few  days  in  the  study. 

(3)  I participated  for  most  of  the  time,  but  I did  not 

finish  reading  the  material. 

(4)  I finished  reading  the  material,  but  I did  not  complete 

the  questionnaire  or  the  test. 

(5)  I finished  the  material  and  the  questionnaire,  but  I did 

not  take  the  test. 

(6)  I completed  all  parts  of  the  study. 

Comments  (if  any) : 


B.  If  you  did  not  finish  the  study  please  check  the  reason  (or  reasons) : 

(1)  I was  absent  from  school  too  much  of  the  time. 

(2)  Regular  school  work  kept  me  too  busy. 

(3)  My  parents  did  not  want  me  to  participate. 

(A)  I felt  that  the  study  was  a waste  of  my  time. 

(5)  I just  did  not  want  to  participate. 

(6)  I wanted  to  finish,  but  the  study  was  discontinued  before  I 

was  done. 

(7)  Other  (please  explain)  ■ 


Comments  (if  any) : 


C.  If  you  were  requested  to  participate  in  a similar  study  sometime  in  the 
future,  would  you? 

(1)  Yes,  I would  participate. 

_____  (2)  No,  I would  not  participate. 

(3)  I do  not  know  if  I would  participate  or  not. 

Comments  (if  any) : 

D.  Some  people  have  suggested  that  some  form  of  a reward  be  given  to 
students  who  do  participate  in  studies  such  as  the  one  just  completed. 
Such  a reward  might  be  an  additional  priviledge  at  the  school  or  some 
kind  of  a present  such  as  an  LP  album  or  a pass  to  a movie  theater. 

Do  you  think  that  a reward  should  be  given  to  those  who  participate? 

Yes  No 


-129 


If  you  do,  what  should  it  be: 


E.  Would  you  like  to  see  the  materials  which  you  read  become  part  of 
the  regular  course  work  at  Nova  High  School? 

Yes  No  

Did  you  learn  anything  about  psychology  from  participating  in  the 
study? 

Yes  No 


F. 


APPENDIX  D:  DATA 


Tables  28  through  34  contain  the  data  used  in  the  study.  Table  28 
contains  the  data  for  students  in  Version  1 of  the  material.  Table  29 
contains  the  data  for  Version  2,  and  so  on.  The  data  included  are  listed 
below,  keyed  according  to  the  columnar  headings  in  the  tables: 

Id.#  - An  arbitrary  number  assigned  to  each  student.  It  is  not 
the  school-assigned  student  number. 

Version  - The  number  of  the  material  version  as  described  in  Chapter  II. 
Group  - The  letter  representing  the  physical  grouping  of  each  student 
used  for  scheduling. 

Sex  - 0 represents  male  and  1 female. 

Grade  - 0 represents  tenth  grade  and  1 represents  eleventh  grade. 

Time  - The  time  in  minutes  in  which  each  student  completed  the 

materials . 

Error  - The  errors  made  by  each  student. 

R - the  actual  number  of  errors 
% - the  percentage  of  errors 

T - T-scores  based  on  the  mean  and  standard  deviations  of 
the  students  in  each  version 

SCAT  (Perc.)  - SCAT  Verbal,  Quantitative,  and  Total  scores  reported  in 
percentiles  based  on  Florida  norms. 

SCAT  (T)  - SCAT  Verbal,  Quantitative,  and  Total  scores  in  which  the 

percentiles  have  been  converted  to  normalized  T-scores. 
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Status  - The  completion  status  of  each  of  the  students  according 
to  the  following  code: 

0 - did  not  start 

1 - participated  1 or  2 sessions 

2 - participated  most  of  the  time,  but  did  not  finish 

3 - finished  reading,  but  did  not  take  the  test 

4 - finished  all  parts  of  the  study 

Additional  information  on  the  data  is  presented  in  the  text  — especially 


in  Chapter  II. 


Data  for  Students  in  Version  1 of  the 
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